^s 


Psychological Review 


Cuartes N. Corer, EDITOR 


Pennsylvania State University — . 


> VOLUME 73, 1966 


^ Heren ORR ELIZABETH MULLEN 
Managing Editor Editorial Assistant 
BIS 
Rosert D. GORMAN 
Henry S. SALMON Tl 
e 
Advertising and Business Manager Subscription Manager 


Pd 


BIMONTHLY BY THE 
AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
PRINCE ann LEMON STS. ee 4 oe 
anp 1200 SEVENTEENTH ST. N. W., WASHINC ,D.C 


PUBLISHED 


JACK A. ADAMS 
Rozert E. ADAMSON 
Norman H. ANDERSON 
FRED ATTNEAVE 
GEORGE Bartow 
WILLIAM S. BarrERSBY 
FRANK A. BEACH 
D. E. BERLYNE 
Rocer W. BLACK 
Donacp BLoUGH 
WaiırLram H. Bocarrz 
Lye E. BOURNE 
Weston A. BOUSFIELD 
Gorpon H. Bower 
Jack W. BREHM 
Jupson S. Brown 
AnNoLD H. Buss 
E. C. CARTERETTE 
Dorwin CARTWRIGHT 
ALPHONSE CHAPANIS 
Loren J. CHAPMAN 
Burton H. Conen 
JAcon Conen 
M. R. D’Amato 
James DrEsE 
WirLiAM N. DEMBER 
Vicron H. DeNENBERG 
JoeL ELKES 
WILLIAM EPSTEIN 
CuanLEs W. ERIKSEN 
Donatp W. FISKE 
Joun H. FLAVELL 
Epwin A. FLEISHMAN 
Leo Ganz 
ELEANOR I. GIBSON 
ROBERT GLASER 
RıcHarD C. GONZALEZ 
JACQUELINE I. GoopNow 
ISIDORE GORMEZANO 
ALBERT E. Goss 
Davip M. GREEN 
G. ROBERT GRICE 


Editorial Consultants 


NORMAN GUTTMAN 
Jonn F. Hatt 
KeNNETH R. HAMMOND 
Harry F. Harrow 
Kerru J. Hayes 
RonznT C. Haycoop 
D. O. Hess 

R. J. HERRNSTEIN 
Rocer W. Heyns 
Wayne H. HOLTZMAN 
Fren E. HonvaTH 

J. McV. Hunt 

Leo M. Hurvicn 
Murray E, Jarvix 
James J. JENKINS 
Harry I. KALISH 
Lawrence KARLIN 
GEOFFREY KEPPEL 
WiLLiAM KESSEN 
GREGORY A. KIMBLE 
Jonn I. Lacey 

Roy Lacuman 
Herscuer W. Lemowrrz 
Seymour Levine 
Donatp J. Lewis 
Donatp B. Linpstey 
Frank A. Locan 

R. Duncan Luce 
RonzRr B. MacLrov 
Rosert B. MaLMo 
Epwin Martin 
Mervin H. Marx 


F. Josera McGuican 
ARNOLD MECHANIC 
ARTHUR W. MELTON 
RonrnT E. MILLER 
Haron E. MITZEL 

Jane S. Mouton 
Conran J. MUELLER 
Bennet B. MURDOCK, Jr. 
Jerome L. Myers 


LANCASTER PRESS, INC., LANCASTER, PA. 


J. NacHMIAS 
Turopore M. Newcome 
Kenneru New OGLE 
Davi S. PALERMO 
Rogert W. PAYNE 
Cart PFAFFMAN 
Howarp R. PoLLIO 
WirLram F. PROKASY 
ALLAN RECHTSCHAFFEN 
FRANK RESTLE 
Harrier L. RHEINGOLD 
BrNsow F, Rırrcnie 
Jurian B. Rorrer 
Ruporen W. ScnuLz 
Roserr SEEL 

Joun P. Sewarp 

Seru K. SHARPLESS 
Rocer N. SHEPARD 
Jerome E. SINGER 
Norman J. SLAMECKA 
RıcHARD L. SOLOMON 
KENNETH W. SPENCE 
CHARLES D. SPIELBERGER 
ARTHUR W. STAATS 
Error STELLAR 

Jonn Taros 

Joun W. TuisAuT 
Rican F, THOMPSON 
WARREN S. TORGERSON 
Tuomas Trarasso 
ENDEL TULVING 
ALLAN R. WAGNER 
Epwarp L. WALKER 
Nancy C. Waucu 
Wise B. Wess 
SDNEY WEINSTEIN 
Rosert W. Wuire 
Rura C. WyL 
Ricwarp P, Yourz 
PHILIP G. ZIMBARDO 
JOSEPH Zunrn 


a 


CONTENTS OF VOLUME 73 


Effects as a Function of Social Rein- 


Banai, Reunen M. Social Reinforcement 
orcement History 
Beckwitn, Mary, AND RESTLE, 
ep Joun W. Jr. See Tueros, JOHN. 
— NATHAN, AND OPPENHEIM, Paur, Tensions in Psychology between the 
" ethods of Behaviorism and Phenomenology 2 
ÓROEN, WILLIAM E., Jr. Response Disorganization an 
à in Schizophrenia. «-aecasiveneessnae eere tet ced yt oreanizaton’ TH 579 
ROEN, WiLLiIAM E, JR, AN ac 
Process Underlying a Schizo 

Broen, WitLiAM E, Jm. See STORMS, 
Cairns, Roverr B. Attachment Behavior Oo 
CAMPBELL, Byron A.„ AND JAYNES, JULIAN. 
CapALpr E. J. Partial Reinforcement: A Hypoth 
CARRINGTON, PATRICIA. See EPHRON, Harmon S. 
Conen, Bertram D. See ROSENBERG, SEYMOUR. 
Day, RuvH S. See Horowitz, Leonarp M. 


DzNixz, L. DOUGLAS. See SPIELBERGER, Cuartes D. 
Exsrrann, Bruce R, WALLACE, WiLLIAM P, AND UNDERWOOD, Benton J. A Fre- 


quency Theory of "Verbal-Discrimination Lenrning...«««« eee tnnt 
Exsrranp, Bruce R. See Unperwoop, BENTON Je 
Ernron, HARMON S, PATRICIA. 


AND CARRINGTON, 
BON itr] Homeostasis trae e S opas Ret MO UE 
Optical Texture versus Retinal Perspective: A Reply 


Reinstatement 
esis of Sequential Effects........ 450 


566 


365 


Games, PAUL A. Com 


Ganz, Lro. Mechanism * 
ffect of Set on Perception....-++++++++* 


On Remote Associations and the Inter- 
248 


Haxes, Davin T., AND Youns, 1 
i Experiments... A 


pretation o 
Hav, Joux C. Optical Motions 4 
son’s AC COROT orang irr da 
Horman, Eric w. Tests for Spontaneous Alter 
Horowitz, LEONARD M., NORMAN, SANDRA A., AND Day, 
Associative Symmetry... 
Irwin, Francis W. Criteria of wei iid y 
Irwin, FRANCIS w. Reply to Paul Thomas oung...-- E ; 
Jaxonovirs, LEON Mediation Theory and the “Single-Stage” S-R Model: ; 
pone fate A Ogee 3 
Byron A. 


Jaynes, Jutan. See CAMPBELL, 


Kessen, Wirrraat. See MuNSINGER: 
e Uncon 


HARRY. 
ditioned Response in Classical Conditioning 232 
ulsive Shock and Inhibi- 
388 


KrmmeL, H. D. Inhibition of tb 
Lewis, DONALD J. AND MAHER, BRENDAN Electroconv 
tion: Some Problems Considered... 77 
LienermMAN,, Lewis R. On * Actuaria Methods a 
cv aped e ELLA 
87 


Validation of Diagnostic Tests". v 
Loveyoy, EnijAm. Analysis of the Overlearning 


MAHER, BRENDAN A. See Lewis, DONALD J. 
Maxous, W. L. Cutaneous Color Sensitivity: Explanation and Demonstration.... 280 


L., AND PETRINOVICH, Lewis L. Neural Consolidation and Elec- 
382 


McGaucH, JAMES 
troconvulsive Shock: IRGXMHUREU xe tiene anne si aan dt nen seva 


iv CONTENTS 


McKee, Joun P. See Riley, Donatp A. 
Munsrncer, Harry, AND KesseN, WILLIAM. Stimulus Variability and Cogni- 
tive Change.....eseee Imm IH me HH emen 164 
Norman, Sanpra A. See Honowrrz, Leonarp M. 
OPPENHEIM, PauL. See Bropy, NATHAN. 
Parks, TuHeopore E. Signal-Detectability Theory of Recognition-Memory Per- 
pO" 44 
Pererson, Ltovp R. Short-Term Verbal Memory and Learning.................. 193 
Perrinovicu, Lewis L. See McGauaB, James L. 
Pracer, JEAN. Response to Brian Sutton-Smith..................... 
Pinneo, Lawrence R. On Noise da; tlie INécyous Syla aseene aaae 
Restie, FRANK. See BECKWITH, Mary. 
Ritey, DONALD A., SHERMAN, MARIAN, AN s 
mediate Size Discrimination and [419 a aer a, Comment on Inter- 
lesa AQUA iun MEME D. Ree Peces d Bales. 


RourrENBERG, AnvEH. Neural Mechanism : 3 " 
Formation Function s of Sleep: Changing View of Reticular 


SHERMAN, Marian. See Rirgv, DONALD A. 


SPIELBERGER, CHARLES D., AND DENiKE, L. Dovcras. 


sus Cognitive Theory in Verbal Operant Cieri Behaviorism ver- 
SPENCE Kenneth W. Cognitive and Drive Factors in th 'E Ebor taper be 
itioned Eye Blink in Human Subjects e Extinction of the Con- 


306 


Tuneros, JOHN, AND Bretsrorp, J Wi dX. Cate ioe woes ios acci pe mee 
DS; JOHN, Baetsronb, Jomw WA Jk, A Mac V Des see 
ru oning! Aniston tore ung Rati, Mag for CS or. a 
MPSON, ARLES P. gems BES MEMO 
Transfer Surfaces... qon: Incompatibility of the Houston and Osgood 
THowPsoN, RicHamD E, AND Sperone, WD It itttemteemtempeneteen nnper eene 586 
Ph s AN PENCE, Witersan Arc Dom qe terere 
Pie oed P tudy of Neuronal don, | uei opi A Model i 
ipsios se Ma » AND EKSTRAN NÍOR. cens gg 
i comings in the Interference Theory nm R. An Analysis of Some Short- 
Unpberwoop, Benton J. See EKSTRAND Bruce ie our venies arae reda as da TE 540 
— Witttam P. See Exstranp, Bruce "à 
EITZMAN, R. A. Statistical Learnin en 
rnin, " 
Wuaten, Ricnarp E. Sexual wake and Individual Differences... -+ 35 
WHEELER, Lapp. Toward a Theory of Beh REN 151 


Youxc, PauL Tuomas. Hedonic Organiza 
Youwc, Roperr K. See Hakzs, Dav T 


ZEILER, MicHAEL D. The Stimulus i 
S in the Intermedi 
late Size Problem 


Vor. 73, No. 1 


JANUARY 1966 


PSYCHOLOGICAL REVIEW 


AVAILABILITY AND ASSOCIATIVE SYMMETRY * 


LEONARD M. HOROWITZ, SANDRA A. NORMAN, AND RUTH S. DAY 


Stanford University 


This paper 
role in the cognitive processes. 
operationally defined by the prob 


15-sec. delay. (AV seems to grow fast 
It also grows, though not as fast, 
This definition is used to exa 


memory. 
without producing it.) 
of associati 
Sources of asymmetry 


cept's theoretical implications for m 


iguists, à word can 


According to lir 
daries in a sen- 


be defined by its boun 
tence. For Hockett (1958) a word is 
“bounded by successive points at which 
pausing is possible [p. 167].” Pauses 


never occur in the middle of a word; 


a pause would destroy the gestalt of 


the word, and a word's gestalt dis- 
tinguishes it from a mere collection of 
speech sounds. Thus, à brand new 
word might be analyzed into speech 
sounds, but after repetitions the word 
as a whole becomes psychologically 
primary. Its properties no longer de- 
pend on properties of the ingredient 
sounds. This shift from the sound as 
a unit to the whole word as a unit is 
a remarkable fact of language, @ fact 
which may help explain the versatility 
of memory and thought. The present 
authors assume that this shift is grad- 


ual and continuous. And the word is 
1This research was supported in part by 
e Grant 


United States Public Health Servic 
HD 00922-03. 


examines the concept of availability 
An item's availability (AV) is 
ability that S could recall it after a 


ve symmetry, and evidenci 
in natural language 
emory and thought are discussed. 


and its theoretical 


st when S produces it from 
when S sees the item 
mine the principle 
e is presented to support it. 
are examined, and the con- 


said to grow more and more “avail- 
able.” 

The term “availability” has been 
used in two different senses. Perhaps 
in this paper it should be called "item 
availability" because it is used to de- 
scribe a single verbal unit. Two re- 
cent authors have also used availability 
in this way. According to Under- 
wood, Runquist, and Schulz (1959), 
an available item is "readily re- 
callable"; for Asch and Ebenholtz 
(1962b) it is “accessible to recall.” 
Ttem availability denotes how readily 
1 unit comes to mind as a label 
how salient it is as a 
how often it occurs 


a verba 
for experience, 
unit of speech, or 
in verbal behavior. 

Availability, in the second sense, de- 
notes an associate's position in some 
response hierarchy. The higher the 
associate in the hierarchy, the more 
available it is said to be. Perhaps 
“hierarchy availability" would better 


convey this meaning. Hierarchy avail- 
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TABLE 1 


INITIAL MEASURES OF AVAILABILITY 


Test item Value of AV 
ZUMAP .00 
LATUK 04 
SAGROLE .04 
NEGLAN .08 
'TAROP .08 
QUIPSON 16 
GOJEY 20 
POLEF 24 
VOLVAP 24 
MEARDON .28 


ability resembles the concept of re- 
sponse strength, but the terms have 
slightly different meanings. Response 
strength usually describes the rein- 
forcement history of an S-R bond. 
An item's response strength does not 
change abruptly from time to time 
Hierarchy availability, on the other 
hand, connotes variability in an item's 
hierarchy position. Under different 
stimulating conditions, different re- 
sponses ascend. As one example, the 
hierarchy changes drastically if the 
stimulus is placed in a different con- 
text (Howes & Osgood, 1954). Or 
second, a given response in the hier- 
archy can be “primed” : If some irrele- 
vant stimulus elicits a response, that 
response becomes more probable later 
in free association. Also, as this ar- 
ticle shows, a response typically as- 
cends in a hierarchy whenever its item 
availability increases. 

Item availability is chosen as the 
more fundamental concept since it is 
independent of any particular response 
hierarchy. In general, an item grows 
more available as the subject has more 
experience with it. But the critical 
growth variables have not been known, 
so the concept has not been defined 
precisely. This article furst proposes 
an operational definition of item avail- 
ability, then shows how an item s avail- 


ability can be increased empirically, 
and, finally, examines the theoretical 
role of availability in the cognitive 
processes. 


AN OPERATIONAL DEFINITION 


A “test item” is defined as a collec- 
tion of letters whose availability was 
to be measured. Test items in the 
present studies were always two-syl- 
lable nonsense words. Each was 
printed on a 4 x 6-inch index card 
as the middle of three items. The 
items were arranged vertically, and the 
two extra ones comprised the "test 
frame.” Thus, the test item NEGLAN 
was embedded in a test frame to form 
the triad BALAP NEGLAN GO- 
KEM. All three items appeared in 
large capital letters, and the subject 
studied the display for 2 seconds. 

Then the card was reversed to ex- 
pose a three-digit number. The sub- 
ject repeated this number aloud and 
then started counting backwards by 
3s. A metronome was set for 120 
beats per minute, and the subject 
counted in rhythm with every other 
beat. After 15 seconds, the subject 
stopped counting and tried to recall 
the triad of items. Twenty-five sub- 
jects were tested in this way. The 
test item's availability (AV) was then 
defined as the proportion of the sub- 
jects who recalled it correctly after the 
delay. This procedure is based on the 
experimental method of Peterson and 
Peterson (1959). It follows Hunter 
(1913, pp. 52-62) in trying to meas- 
ure how well the nervous system sus- 
tains symbolic activity over time. 

Ten test items were measured in 
this way. They were all selected from 
Noble's (1952) list to have m values 
of 2.00 or less. Each test item was 
systematically embedded in 10 differ- 
ent test frames. A test item was as- 
signed to a different test frame from 
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one subject to the next, and the order 
of presentation was systematically var- 
ied. Resulting AV values are pre- 
sented in Table 1. 


How TO INCREASE AVAILABILITY 


The following experiment (Experi- 
ment I) was designed to increase the 
above values. The results show that 
an item’s AV grows fastest if the sub- 
ject produces the item from memory. 
Availability also grows, though not as 
rapidly, when the subject sees the 
item without producing it. 

First, the subject learned a PA task 
with four pairs: A-B, B-C, D-E, and 
F-E. In this task, B served a dual 
role; it was the stimulus of one pair 
and the response of another. Item E 
served two roles as well, the response 
of two pairs. The following diagram 
describes the associative relationship 
among items. An arrow in the dia- 
gram leads from a PA stimulus to a 
PA response. 

F 


4 


ABC DE 


this task was 


The vocabulary of 
For one group 


selected from Table 1. 
of subjects (Experiment Ja) the fol- 
lowing nonsense words were adopted : 
LATUK, A; QUIPSON, B; VOL- 
VAP, C; TAROP, D; POLEF, E; 
and MEARDON, F. For another 
group of subjects (Experiment Ib) the 
vocabulary was: LVAP, 
GOJEY, B; LATUK, C; POLEF, 
D; NEGLAN, E; and SAGROLE, F. 
Words were assigned to the two +. 

tasks so as to counterbalance initial 
item availabilities. Thus, the A 
Item A was .04 in Experiment Ia an 

.24 in Experiment Ib, averaging .^ 
Similar averages for other ingredients 
of the associative structure ranged from 


.14—.18. 


, 


V of 


3 


The four pairs were presented on a 
Lafayette memory drum. First, each 
pair of associates coappeared for 4 
seconds. On each later trial the sub- 
ject tried to anticipate the response. 
The stimulus appeared for 2 seconds, 
and the response then joined it for 
2 more seconds. The interval between 
pairs lasted 4 seconds; the interval be- 
tween trials, 12 seconds. The pairs 
were presented in four different or- 
ders, systematically varied from trial 
to trial. The subject tried to antici- 
pate the response before it appeared, 
and the procedure lasted 10 anticipa- 
tion trials. Twenty-five subjects were 
tested in each subexperiment. At 
least one perfect trial occurred some- 
where in the performance of 17 sub- 
jects in Experiment Ia and 16 subjects 
in Experiment Ib. 

After the PA task, each item’s 
availability was measured by the pro- 
cedure described above. Four control 
items which the subject had not seen 
before were added to make a batch of 
10 test items. In Experiment Ia, the 
control items were GOJEY, SAG- 
ROLE, NEGLAN, and ZUMAP; in 
Experiment Ib, TAROP, QUIPSON, 
ZUMAP, and MEARDON. 

The resulting AV values for all 
items are presented in Table 2. he 
highest value was only .84, so the 
items probably did not reach the level 
of bonafide words. Only one critical 


variable seemed to determine an item's 


AV level—namely, its role in the PA 
items are items 


task. “Produced” 
which the subject had produced from 
memory, "seen" items had just been 
stimuli, and “control” items had not 
appeared in the PA task at all. Aver- 
aged across the two experiments, pro- 
duced items earned a mean AV value 
of .72; seen items, .50; control items, 
18. Within each experiment the differ- 


ence between seen and produced items in 
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TABLE 2 


MEASURES OF AVAILABILITY AFTER PA ‘Task 


Experiment Ia 


| Experiment Ib 


PA role — mm = 
Item Value of AV. 
| — == 
Produced | QUIPSON (B) EM 
| VOLVAP (C) E 
POLEF (I) Ot 
Seen LATUK(A) EX 
TAROP(D) 10 
MEARDON (F) 76 
Control GOJEY Ag 
NEGLAN 12 
SAGOLE 08 
ZUMAP 12 


M Item Value of AV M 
| GOJEY (RB, 70 
75 | LATUK (C) T2 | 68 
| NEGLAN (E) 56 
| 
VOLVAP (A) 86 
56 | POLEF d» | (32 m 
SAGROLE (F) ET 
| MEARDON ET 
A1 | QUIPSON M 24 
TAROP 20 
ZUMAP (04 


mean AV was significant. In Experi- 
ment fa, z = 2.44, p < 02; in Experi- 
ment Ib, z = 2.96, p < .01. Likewise, 
the difference between seen and con- 
trol items was significant in each ex- 
periment. In Experiment Ia, z = 6.41, 
p < .001; in Experiment Ib, 2 2 279, 
P < .01. Item E, the response to two 
different stimuli, seems to have earned 
a slightly lower AV value than the 
other two produced items. This sur- 
prising result appeared in both experi- 
ments and needs to be examined fur- 
ther. 

Thus, in general, an item’s avail- 
ability increases when the subject pro- 
duces it from memory. Articulating 
an item is no more effective than look- 
ing at it (Underwood, 1964), so the 
subject must have to produce the item 
from memory to raise its AV. Pro- 
ducing the item from memory seems 
to unify the ingredient sounds into a 
larger gestalt. And this growth in 
AV does not depend on the item's 
associates. Thus, free, serial, or PA 
recalls would all heighten item avail- 
abilities. PA learning is especially 
interesting. since the subject only pro- 
duces the response from memory, and 


the response thus becomes more avail- 
able than the stimulus. PA stimuli 
grow more available, too, but accord- 
ing to Experiment I, not as greatly. 


PRINCIPLE OF ASSOCIATIVE 
SYMMETRY 


Availability deserves to be meas- 
ured because of a recent issue in the 
study of cognition. Asch and Eben- 
holtz (1962b) and Horowitz, Brown, 
and Weissbluth (1964) have shown 
that the verbal association process may 
differ fundamentally from classical 
conditioning. While a conditioned re- 
sponse is directional —a stimulus elicits 
a response—a verbal association may 
he symmetrical, According to this 
hypothesis, an association between two 
verbal units, A-D, is symmetrical if 
A and B are equally available. Equally 
available associates are supposed to 
elicit each other with equal probabili- 
ties. "Thus, an association. may be 
viewed as a pair of equipotent S-R 
bonds or perhaps as a union of two 
clements into a new gestalt. The au- 
thors find cither view tenable. 

The principle further asserts that an 
association exhibits direction if the 


AVAILABILITY AND ASSOCIATIVE SYMMETRY 5 


associates differ in availability. If so, 
the less-available associate seems to 
clicit the more-available one, but not 
Phenomenally, then, the 
association would resemble a condi- 
tioned response. In PA learning, es- 
pecially, the associates differ in avail- 
ability, and the less-available stimulus 
would seem to elicit the more-avail- 
able response. According to the prin- 
ciple, however, the latent backward 
association could be made overt by any 
operation that increased the stimulus 


vice versa. 


availability. 

To test the principle of associative 
symmetry, Horowitz, Brown, and 
Weisshluth (1964) prepared a PA 
task that contained six pairs of non- 
The stimuli of some 
of other pairs, 
avail- 


sense words. 
pairs were responses 
so these stimuli became highly 
able. (One pair, for example, was 
denoted C-E; its stimulus served as 
a response in the pair A-C and thus 
available.) After the 
PA task, he was 
ociations to each 
vocabulary. 


became highly 
subject mastered the 
asked for two free ass 
the task's 
ere prepared to 
item elicited 


ingredient of 
Response hierarchies W 
show how often every 
every other item as associate. 

In every hierarchy the strongest Te- 
sponses were available ones. The 
most popular associate any item elic- 
ited was an available one which had 
coappeared with it during learning. 
The "direction" of pairing. though. 
was unimportant. The association 
C-E, for example, was supposed to be 


symmetrical. Therefore, the proba- 
E in free as- 


bility that C elicited í 
sociation was compared with the 
elicited C. These 


probability that E 


probabilities did 
. The former was 4 


cantly : 
39. The subject had also 
The probability that E 


not differ signifi- 
1, the latter. 
learned a 


pair, E-G. i 
elicited G was compared with the 
probability that E elicited C. These 


figures did not differ significantly 
either: The former equaled .42, the 
latter, .39. Comparisons like these 
strongly supported the principle of as- 
sociative symmetry. 

When item availabilities did differ, 
the data revealed marked directionali- 
ties. The stimulus of A-C was less 
available since A was not produced. 
The probability that A elicited C in 
free association was therefore com- 
pared with the probability that C elic- 
ited A. Whereas the former proba- 
bility equaled .49, the latter was only 


12: 
A REFINED REPLICATION 


These comparisons were all com- 
parisons within subjects. Horowitz, 
Brown, and Weissbluth (1964) did not 
e on a single association to vary 
just its directionality in different sub- 
The following experiment (Ex- 
achieved this goal and 
above results. This 
d the free associa- 


operat 


jects. 
periment II) 
also replicated the 


experiment compare 
tions of two experimental groups whose 


learning tasks differed slightly. Their 
associative structures are diagramed 
below to describe each group’s PA 
task. Again, an arrow in the diagram 
leads from a PA stimulus to its PA 
Each task contained five 


response. 
ates with four items 


pairs of associ 
serving as responses. 
Condition 1 
p E 
{ l 
A—BoCAF 
Condition 2 
E 
l 
A>B>C>F D>X 


The major difference focused on 
Item A, which became more available 
in Condition 1. From the principle of 
associative symmetry, the backward 
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association B-A should only have ap- 
peared in Condition 1. In Condition 1 
alone its free-association incidence was 
to equal the incidence of A-B and B-C. 
B-C and C-F, though, should have 
been symmetrical in both experimental 
conditions. 

An association was also expected 
to develop between Items A and E. 
These items were both stimuli for B 
during learning.  Kjeldergaard and 
Horton's (1960) "stimulus equiva- 
lence" paradigms would predict such 
an association, and so would the prin- 
ciple of induction described below. 
Condition 2 did not make either asso- 
ciate available, though, so both direc- 
tions of this association were expected 
to stay latent. Condition 1, however, 
allowed E to elicit the more available 
A and thus reveal a directional asso- 
ciation. 

Twenty-five subjects were assigned 
to each condition. First each subject 
learned a PA task with nonsense 
words as the vocabulary (Set N). 
The nonsense words were selected 
from Noble's (1952) list to have m 
values of 2.00 or less. They were: 
VOLVAP, A; TAROP, B; QUIP- 
SON, C; MEARDON, D; NEGLAN, 
E; GOJEY, F; and LATUR, X. 

After the subject mastered the 
learning task, he received a booklet 
for his free associations. Each page 
contained one item of the task, and 
this item appeared on two consecutive 
pages. The subject was instructed to 
supply two associations to every item. 
A hierarchy of associations was then 

compiled for each item to show all of 
the associations and their probabilities 
of occurrence. 

Then the subject learned another 
PA task based on the same associative 
structure. This task's vocabulary was 
composed of low-frequency English 
words (Set W). These words were 
also selected from Noble's (1952) list 


to have i values of 2.00 or less. They 
were: TUMBRIL, A; XYLEM, B; 
RENNET,C;ATTAR, D; BODKIN, 
E; WIDGEON, F; and DAVIT, X. 
After the subject mastered this PA 
task, another word-association test was 
administered, and response hierarchies 
for Set W were compiled. 

Pairs of associates were presented 
on a Lafayette memory drum. The 
stimulus was exposed alone for 2 sec- 
onds, and then the stimulus and re- 
sponse coappeared for 2 seconds. A 
4-second interval separated pairs, and 
the intertrial interval lasted 12 sec- 
onds. Four different orders of presen- 
tation were adopted, and the task con- 
tinued to a criterion of two successive 
perfect trials. 

The principal data concern the sub- 
ject's responses on the association test. 
Each subject produced two associa- 
tions per item in each set. An item's 
response hierarchy is then defined as 
its frequency distribution of responses, 
A response hierarchy contained 50 re- 
sponses; for convenience the response 
frequencies are reported as propor- 
tions. The value of r between corre- 
sponding proportions of Set N and 
Set W was .89 for Condition 1 and 
-90 for Condition 2. Because of the 
Strong agreement between Sets, corre- 
sponding values were averaged. 

The stimulus of each hierarchy is 
called the hierarchy's title to avoid 
theoretical misunderstandings. Here- 
after, each experimental result is la- 
beled by a pair of letters: The first 
letter denotes the hierarchy's title, and 
the second, one of its associates. Thus, 
for example, the probability of AC 
refers to the proportion of times A 
elicited C on the association. test. 
Table 3 reports all of these proba- 
bilities. 

Backward associations were popu- 
lar when the associates were both 
highly available. BA, for example, 
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TAB 


LE 3 


RESPONSE HIERARCHIES FROM FREE-AsSOCIATION TEST 


Condition 1 


Condition 2 


Proportionate frequency Proportionate frequency 
of occurrence of occurrence 
Title | Responses scu NE Title | Responses 
Set N Set W | M Set N Set W M 
A B 40 40 40 B C 38 | 40 | .39 
C 30 | 46 | 23 E EE 30 | 2 
D da p^ = \ A2 A2 i12 
n x . dl a 2 
E || [m siala 
B c 34 .38 36 D .08 .06 .07 
: g p. E: č F 40 | 40 | .40 
E “ae ‘s | 08 B 36 | 42 | .39 
E 06 10 .08 x lt 06 | .10 
E .06 | .08 | .07 
C lu ay (| suo a8 A 02 | .04 | .03 
n 32 34 3d D 02 | .00 | .01 
\ 22 10 16 p 
D ‘os | .06 | -07 D x 0) |i Sez |) s 
E 02 10 06 B 24 .20 22 
: M F 40 | 10 | -10 
D A H, M = € .08 -10 09 
B 28 A EU E ‘os | 0 | 0 
E pt 2 | .09 A 02 | 04 | 03 
E .02 02 02 E B 50 Ad AT 
; AT F Am | x30 | 2 
E R B a p c 08 | 18 | -13 
F qa | dh | old x 48 | .08 | 13 
C o8 | 42 |, 20 D 08 | .06 | .07 
D 04 (04 | -04 A ‘ot | .04 | .04 
; 5 Ad n š c ag | 42 | 45 
E 5 E SU || d i B 20 | 35 | 328 
A 06 AS A X 24 .08 16 
E pg | 08 | 07 E ‘a2 | 40 | .06 
D 04 10 07 D 06 | .00 | .03 
pi A 00 | 04 | 02 
Condition 2 " D E 38 36 
4 Es 6 30 28 
R " " 32 AT B 36 | 4 28 
2 24 46 .20 € 46 44 AS 
> 00 ‘08 em F 3€ | ae | B 
5 706 10 .08 E 40 | -02 06 
D .06 40 .08 A 04 04 04 
E 02 4 | 0 
ar ly from the probability of BC 
was very probable in Conaion i : 36). nor from the probability of AB 
where A was highly available; 1S 40). The former 5 — .67, the lat- 
was C In Condi- 


probability, averaged across sets, 


32. This figure did not differ signifi- 


ter, 1.18; both p’s > 20. 
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tion 2, though, where A was less avail- 
able, the probability of BA was .12, a 
significant reduction from its value in 
Condition 1; z= 3.41, p < .001. 

In both experimental conditions 
Items B, C, and F were highly avail- 
able. "Therefore, the incidences of BC 
and CF were expected to equal those 
of CB and FC, the backward associa- 
tions. The probability of BC was .36 
in Condition 1, in Condition 2, .39. 
'The corresponding probabilities of CB 
were .33 and .39. Similarly, the prob- 
ability of CF was .38 in Condition 1 
and .40 in Condition 2. The corre- 
sponding probabilities of FC were .47 
and .45. No pair of differences here 
was statistically significant, all s's < 
1.29, p 2.10. The slightly greater 
incidence of FC (a backward associa- 
tion) may be due to F's unique posi- 
tion in the associative structure ( Horo- 
witz, Brown, & Weissbluth, 1964). 

An induced association is defined as 
an association that develops between 
two items which never coappeared 
during learning. Since this term is 
neutral theoretically, it is preferred to 
“mediation.” The principle of induc- 
tion asserts that an association þe- 
comes induced between two items 
which have shared a common linguistic 
context (Horowitz, Brown, & Weiss- 
bluth, 1964). Of course, the induced 
association only becomes overt when 
one associate or both are highly avail- 
able. This principle explains how para- 
digmatic associations develop (Mc- 
Neill, 1963). It tells why a high- 
frequency adjective becomes associated 
with its polar opposite, even though 
the two words do not usually coappear 
in the language. 

An induced association was expected 
between Items E and A. This asso- 
ciation was only to be evident when 
A was available. And, indeed, the 
probability of FA was .25 in Condi- 
tion 1 but only .04 in Condition 2. 


The difference was statistically signifi- 
cant, z= 422, p< 001. Further- 
more—and this is important —EA was 
more probable than EC in Condition 
1,2—278, p< 01. While EA had 
satisfied the conditions for induction, 
EC had not. 


RESPONSE LATENCIES 


Asch and Lindner (1963) have im- 
plied that a response's latency de- 
pends on its availability. If the two 
associates are equally available, their 
two directions of recall should have 
equal latencies. A subject is said to 
perform a "forward recall" of an asso- 
ciation, A-B, when he observes A and 
supplies D. He performs a “back- 
ward recall" when he observes B and 
supplies A. If A and B are equally 
available, the forward and backward 
recall latencies should be equal. Ex- 
periment III was designed to examine 
this hypothesis. 

The associative structure of Condi- 
tion 1 above was adopted with Set 
W's vocabulary. Twenty-six subjects 
learned this PA task to a criterion of 
one perfect trial ; the procedure exactly 
replicated the one described above. 
Then cach subject was tested for for- 
ward and backward recall. He was 
given a card that contained one word 
of Set W and a blank space next to 
it. Sometimes the blank appeared to 
the word’s left, sometimes to its right. 
The subject had to say which word 
belonged in the blank. Thus, every 
stimulus and every response was pre- 
sented alone once, and the subject had 
to supply its associate. The card with 
Item B in the response position had 
two correct answers because of the 
two learned pairs, A-B and E-B. 
When this card appeared, the subject 
was asked to give both correct an- 
Swers. 

Each subject could produce at most 
10 errors; the average subject, how- 
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ever, only produced 1.32 errors (33 
errors in all). Four errors were omis- 
sions, three were intrusions, and the 
rest were internal errors. An internal 
error is a response from the task’s 
vocabulary which the subject had pro- 
duced at the wrong time. Three of 
the pairs were supposed to contain 
equally available associates: A-B, B-C, 
and C-F. These pairs incurred 0, 7, 
and 3 forward errors, respectively, and 
2, 6, and 2 backward errors. Hence, 
the total number of forward errors 
exactly equaled the total number of 
backward errors. The pairs D-A and 
E-B, however, which were supposed 
to be directional, incurred 3 and 1 for- 
ward errors, respectively, but 5 and 
4 backward errors. 

Response latencies are presented 
pair by pair. D-A was supposed to 
be directional, so @ subject who did 
recall both associates was expected to 
recall A faster. Nineteen subjects re- 
called both associates correctly, and 


14 subjects recalled A faster. The 
mean latency for D was 2.30 seconds. 
The differ- 


that for A, 1.83 seconds. 

ence was statistically signi 

= 2.14, p < 05. 
Since B-C and 


ficant, t (18) 


C-F were supposed 
to be symmetrical, the forward and 
backward recalls were not to have dif- 
fered in latency. The results confirmed 
this hypothesis. Eighteen subjects re- 
called both associates of B-C; 10 of 
them recalled the forward faster, and 
8 recalled the backward faster. The 
mean latency of B's recall was 3.79 
seconds and that of C’s recall, 3.11 
seconds. The difference was not sta- 
tistically significant; t (17) = 1.11, 
p> .10. Twenty-one subjects reca 
both associates of C-F, and only 9 of 
them recalled the forward faster. 
mean latency of C's recall was 1.91 
seconds and that of F's recall. 2.33 
seconds. This difference was not sta- 


tistically significant either; ? (20) = 
—1.40, p > .10. f 

. The two remaining pairs, A-B and 
E-B, introduced a special complica- 
tion. When B was displayed for back- 
ward recall, A and E were both possi- 
ble answers, and the subject tried to 
supply them both. The authors ex- 
pected the subject to recall A and B 
with equal ease, but to take longer on 
E. (A-B was symmetrical, E-B was 
not) This hypothesis was not con- 
firmed. The competition in backward 
recall may have camouflaged a latency 
difference between A and E. Nine- 
teen subjects supplied both backward 
recalls; 9 of them recalled A first and 
10 recalled E first. The latencies of 
A or E, whichever occurred first, were 
recorded, and the mean A and E la- 
tencies of this listing were compared. 
The mean A latency was 3.57 seconds ; 
ean E latency was 2.92 seconds. 
This difference was not statistically 
significant ; ¿< 1.00. The mean of 
the entire listing, 3.26 seconds, was 
significantly greater than the corre- 
sponding B latencies of the forward 
recalls; F (2, 72) = 3.44, p< 05. 
The B latencies of. A-B's forward re- 
call averaged 2.54 seconds, that of 
E-B’s forward recall, 2.10 seconds. 


the m 


OTHER CAUSES OF DIRECTIONALITY 


These results generally support the 
principle of associative symmetry, 
but the principle poses further prob- 
For example, most words of 
everyday English are so highly avail- 
able that their AV values cannot differ 
asymmetrics emerge 
in word-association norms: BEAUTIFUL 
strongly elicits WOMAN in the Minne- 
sota norms (Russell & Jenkins, 1954), 
but woman scarcely elicits BEAUTIFUL ; 
AFRAID strongly elicits DARK, but DARK 
does not elicit AFRAID. The authors 
believe that the subject’s "set" may 


lems. 


materially: still, 
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create these asymmetries, and several 
sources of set are examined next. 

First, the subject’s task instructions 

may create directionalities. The sub- 
ject may be instructed (or he may in- 
struct himself) to supply, say, descrip- 
tive associates. Then wicGHT would 
strongly elicit DARK—not so much 
through verbal habits as through the 
subject's view of reality. There prob- 
ably are no instructions, however, 
which can demand NIGHT so uniquely 
as a response to DARK. 

Verbal behavior is always con- 
strained in this way. If the task in- 
structions do not specify constraints, 
then the rules of English grammar do. 
Thus, grammatical rules define a sec- 
ond set that may influence word as- 
sociations. The subject has learned 
transitional probabilities from one 
grammatical class to another in spoken 
English, so he responds to an adjec- 
tive with a noun and to a noun with 
à verb. Associations may still be sym- 
metrical for practical matters of mem- 
ory, but the set to be grammatical in 
free association may obscure these 
symmetries, 

The authors have tried to simulate 

a grammatical set experimentally, but 
the attempt failed. The procedure of 
Experiment II was adopted, and the 
subject learned five pairs of associates. 
Each pair was embedded in meaning- 
ful sentences to give each associate 
grammatical form. The pair, VOL- 
VAP-TAROP, for example, appeared 
as “A VOLVAP often TAROPS,” 
“A VOLVAP is TAROPING,” and so 
on. The sentence frames varied from 
trial to trial, but VOLVAP always 
stood as a noun and TAROP asa verb. 
Then free associations to every item 
were collected. Directional associa- 
tions were expected in place of former 
symmetries. The response hierarchies, 
though, did not differ from those of 
Table 3 in any significant respect. 


Third, an item’s context may pro- 
duce a set for directional associations. 
“Context” denotes other verbal condi- 
tions which enhance some associate’s 
probability of occurring (Howes & 
Osgood, 1954). Say Items A and B 
elicit each other with some small prob- 
ability. When A appears in Context 
C, though, it may elicit B more 
strongly. In other words, the com- 
pound stimulus, A -- C, would elicit 
B more readily than A alone. On the 
other hand, contexts may not exist 
which could make B elicit A that 
strongly. So the compound stimulus 
would be a source of directionality. 

The compound stimulus may key- 
note the way serial learning becomes 
directional. The earlier "chaining hy- 
pothesis" of serial learning asserted 
that each item of a serial list comes to 
elicit the next item. This hypothesis, 
however, has been amply refuted (e.g.. 
Young, 1959). The principle of asso- 
ciative symmetry counters it still fur- 
ther: Since all serial items become 
available, the chaining hypothesis im- 
plies symmetrical associations between 
each pair of adjacent items. Item n 
would elicit items 4 — 1 and n+ 1 
with equal ease. "Thus, serial learning 
ought to be much more arduous than 
itis. Although the serial task is still 
puzzling, the items that precede a 
given item seem to serve as a collec- 
tive stimulus for it (Horowitz, Lipp- 
man, Norman, & McConkie, 1964). 
K alone does not elicit L in the alpha- 
bet, but a compound of preceding let- 
ters does. This hypothesis explains 
why one can recite the alphabet for- 
wards so much more easily than back- 
wards. 

GENERAL IuPLICATIONS 


The concepts of availability and as- 
sociative symmetry help organize the 
data of human memory with appealing 
simplicity. The following implications 
illustrate this simplicity, 
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Meaningfulness and Familiarization 


An item's availability must correlate 
with its meaningfulness (7) and with 
its exposure frequency (7). According 
to Noble (1952), a word's m grows as 
it acquires more associates; its 1 grows 
with each additional exposure. Of 
course, the operations that increase 5 
and n also increase AV. Empirical 
research must still decide whether 7, 
n, and AV are experimentally separa- 
ble, but in this discussion they are as- 
sumed to be correlated at least. 

Conclusions about availability can 
thus be applied to discussions of m 
and n, and vice versa. A PA task de- 
mands available responses, SO it be- 
comes easier when its responses are 
more meaningful or more familiarized. 
Several authors have already shown 
the critical importance of response 
meaningfulness ( Cieutat, 1959; Cieu- 
tat, Stockwell, & Noble, 1958; Hunt, 
1959; Kothurkar, 1963; Morikawa, 
1959; and Weiss, 1958). These au- 
thors have examined different values 
of stimulus m combined with different 
values of response 7. The learning 
rate was greatest when the responses 
were meaningful. In particular, some 


tasks made the subject associate a 
low-m item with a high-m item; the 
when the 


task was always easier 
high-m item served as response. 

Response familiarity would seem to 
facilitate PA learning by making the 
responses more available. Experi- 
mental evidence, though, is equivocal. 
Part of the ambiguity is due to the 
welter of unstandardized procedures 


for increasing n (Bailey & Jeffrey, 
1958; Cieutat, 1960; Gannon & po 


1961; Mandler & Campbell, 
Saltz, 1961; Schulz & Tucker, 1962; 
Weiss, 1958). Some experimenters 
have had the subject study the famil- 
iarized items briefly (eg. Cieutat, 
1960). Others have had him produce 


the items repeatedly by free recall 
(Horowitz & Larsen, 1963). Some- 
times familiarization has introduced a 
special interference component into the 
subject's task (Horowitz & Larsen, 
1963) and could camouflage the ad- 
vantage of gained availability. 


Backward Associations 


Asch and Ebenholtz (1962a) have 
reviewed the data which show how 
prevalent backward associations are. 
Although forward recall usually sur- 
passed backward recall, the responses 
in these studies had usually become 
more available than the stimuli. Given 
that backward recall occurs, though, it 
should occur most readily for available 
stimuli. Jantz and Underwood (1958) 
and Hunt (1959) have confirmed this 
claim. Hunt varied stimulus and re- 
sponse m in a PA task. First, the 
subject learned the PA task to mas- 
tery. Then each response was pre- 
sented alone, and the subject had to 
supply its stimulus. Backward recall 
was best when stimulus m (hence 


stimulus AV) was high. 


Paired-Associate Learning 


Underwood et al. (1959) have ana- 
lyzed paired-associate learning into two 
theoretical stages. One stage con- 
cerns the response's availability and the 
other, its association with the stimu- 
lus. The present results further testify 


to the power of their analysis. 
Underwood et al. (1959) have not 
specified whether the response must 
be available to enter an association. 
The present authors believe not and 
assert that the stages are independent. 
Suppose two items, A and B, co- 
appear in some verbal context, and the 
subject does not produce them. The 
items would become associated, but 
both directions of the association would 
remain latent. The phenomenon would 


resemble latent learning. 
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'fhe authors argue that item avail- 
abilities mainly affect performance. If 
A or B later grew in availability (after 
the association was formed), the A-B 
association would emerge in perform- 
ance. This analysis would explain 
why performance is so much better on 
a recognition task than on a recall 
task. A recognition task provides the 
response alternatives and maximizes 
their availabilities. 

Experiments with humans do not 
usually distinguish between learning 
and performance factors. But an as- 
sociation cannot emerge in the sub- 
ject’s behavior unless the response is 
available enough. Latent associations 
which exist in the nervous system 
might be called “unconscious” and 
might still affect the subject's behav- 
ior: They might contribute to the asso- 
ciative meaning of some concept, help 
shape some attitude, or have clinical 
importance, as described helow. 


Psychotherapy 


Certain associations between un- 

labeled associates are especially promi- 
nent in psychotherapy. Let A and B 
denote two cognitions—A, one that 
develops when the subject perceives 
some subtle interpersonal relationship ; 
and B, one that develops when the 
subject experiences an unpleasant feel- 
ing. The association A-B would then 
link one of these perceptions with one 
of these feelings. Perhaps, for exam- 
ple, a child perceives a subtle exchange 
of hostilities between his parents (A). 
Through identifying and through em- 
pathy, his perception comes to arouse 
an indescribable mixture of anxiety 
and anger (B). Such perceptions 
usually remain unlabeled (hence, un- 
available), so ^ would be an unavail- 
able stimulus. The child could not 
identify the source of his emotional 
discomfort since B, the feeling, does 
not elicit A, its source. 


A psychotherapist might eventually 
hope to get a patient's feeling “worked 
through," extinguished, or modified by 
discrimination training. But as the 
first step, the stimulus must be identi- 
fied. As long as it is not available, the 
behavior cannot be modified. Thus, a 
therapist has to heighten availabilities 
—a prerequisite to any therapeutic 
strategy he might later adopt. Also, 
according to this paper’s view, the pa- 
tient should he the one who labels the 
perceptions, Otherwise, the avail- 
abilities stay less than maximal. 


Transfer 


Positive transfer occurs when some 
task demands an association that the 
subject has already learned through 
contiguity or contextual similarity. 
Performance would remain imperfect, 
though, until all responses were avail- 
able enough. "Thus, some task might 
demand an association that had already 
been induced, but this association would 
remain latent until the response grew 
more available. 

This interpretation can explain dif- 
ferences among some transfer para- 
digms. Horton and Hartman (1963) 
had the subject learn three PA tasks 
composed of low-frequency five-letter 
words. The first task was denoted 
A-B. the second task, B-C, and the 
third task was a mixture of A-C and 


C-A. The A-C pairs of the third 
task showed the greater transfer, 
Horton and Hartman inferred that 


“backward associations" are less effec- 
tive mediators, but an alternate view 
exists: The first two tasks induced an 
association between A and C. Since 
Ttem C had been a response, it was the 
more available. And «o the induced 
association was directional—from A. 
the less-availahle item, to C. The 
writers believe that equal transfer 
would have ensued if A and C had 
been equally available. 


AVAILABILITY AND ASSOCIATIVE SYMMETRY 13 


McCormack (1961) studied transfer 
between PA tasks of high-frequency 
words. First the subject learned a 
task denoted A-B, then one denoted 
C-B, and finally one denoted A-C. 
The resulting transfer was positive, so 
the authors viewed backward associa- 
tions as effective mediators. But the 
stimuli of this experiment, high-fre- 
quency words, were available to start 
with; thus, the induced association 
A-C was overt and symmetrical. The 
writers believe that the transfer to C-A 
would have been equally positive. 

Response competition causes nega- 
tive transfer, but the two competing as- 
sociates may emerge simultaneously. 
Murdock (1958) had three experi- 
mental groups learn an A-B task. Each 


group then transferred to a second 


task—one group to A-C, another to 
All groups 


B-C, a third to C-A. 
egative transfer on the sec- 
and the interference was 
worst to C-A. (In Murdock’s data 
the difference did not reach statistical 


significance, but in Goulet and Bar- 
did.) 


clay's, 1965, replication it 
Some explanation is still needed for the 
gross interference from A-B to C-A— 


since C, it scems, had just one associ- 
d C-A, 


ate. When the subject learne 
the A-B stimulus would have become 
more available, and the A-B associa- 
tion would have become symmetrical. 


Thus, B and C were both eliciting A, 
hence inducing an association between 
ciation, C-B. 


them. This induced asso 
would have then competed with C-A 
to produce the observed interference. 
These principles can also help in- 
terpret recent results of Keppel and 
Underwood (1962) on backward as- 
(Their results were repli- 
cated by Kausler & Kanoti. 1963.) 
First the subject learned an A-B list to 
one perfect trial. A denotes a nonsense 
syllable and B. an adjective. Then the 
subject spent 15 trials learning a sec- 


displayed n 
ond task, 


sociations. 


TABLE 4 


Data or KEPPEL AND UNpERWOOD (1962) 
ORDERED AND [NTERPRETED 


Second | YS" | ISB free of | Does A stay 

task | recall Secate | available? 

- None | 5.80 Yes Unknown 
- MC 5.05 Yes Yes 
- C-D | 4.60 Yes No 
A-B | A-Br | 3.40 No Yes 
A-B | C-B 1.80 No No 


ond task; four second tasks were ex- 
amined. Finally, the subject was 
tested for backward recall of the first 
task. He was shown each response, 
and he had to supply its stimulus. 
The results of this experiment are 
presented in Table 4 according to the 
subject’s second task. Table 4 man- 
ages to rank order these results by 
establishing two facts in Columns 4 
and 5. Column 4 tells whether B was 
free of other associates. The progno- 
sis was poor when B did appear in the 
second task. Column 5 tells whether 
A kept being available during the sec- 
ond task. Item A clearly did stay 
available if it appeared in the second 
task. Thus, the net prognosis was 
best for backward recall when that 
recall followed A-C; it was worst after 
C-B. If greater weight is attached to 
Column 4, Keppel and Underwood's 
(1962) results make sense. 
"Availability" may help relate re- 
search in verbal learning to general 
psychology. Brown and Lenneberg 
(1954) have shown that a color whose 
name is more “codable” is better re- 
membered. Their study was designed 
to test Whorf's hypothesis, but its in- 
terpretation was partly confounded. 
The most codable color names (high- 
frequency words) are probably used to 
label the most frequent sensory ex- 
periences. Thus, word frequencies 
and sensation frequencies probably co- 
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vary. And Brown and Lenneberg's 
(1954) result may reflect a perceptual 
henomenon instead of a linguistic one. 
Codability seems to be related to avail- 
ability, though, so a color's name might 
be made more codable by the methods 
of this paper. Thus, these methods 
might help to clarify and evaluate 
Whorf’s hypothesis. 
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HABITUATION: 


A MODEL PHENOMENON FOR THE STUDY OF 
NEURONAL SUBSTRATES OF BEHAVIOR? 


RICHARD F. THOMPSON axp WILLIAM ALDEN SPENCER? 
University of Oregon Medical School 


The recent habituation literature is reviewed with emphasis on neuro- 
physiological studies. The hindlimb flexion reflex of the acute spinal 
cat is used as a model system for analysis of the neuronal mechanisms 
involved in habituation and sensitization (ie, dishabituation). Ha- 
bituation of this response is demonstrated to follow the same 9 
parametric relations for stimulus and training variables characteristic 
of behavioral response habituation in the intact organism. Habituation 
and sensitization appcar to be central neural processes and probably do 
not involve presynaptic or postsynaptic inhibition. It is suggested that 
they may result from the interaction of neural processes resembling 


"polysynaptic low-frequency depression,” and “facilitatory afterdis- 


charge." 


habituation. 


Perhaps the most fundamental prob- 
lem in physiological psychology is the 
elucidation of neuronal mechanisms 
underlying alterations in behavior. 
This implies prediction of behavioral 
changes from descriptions in terms of 
patterns of synaptic interactions among 
identifiable populations of neurons. 
Two general strategies in vogue today 
are (a) correlational studies relating 
changes in various neuronal responses 
of the intact organism to changes in 
behavior (cf. Morrell, 1961), and (5) 
analytic studies of relatively long-term 
changes induced in simplified neuronal 
systems such as the monosynaptic re- 
flex (cf. Eccles, 1964). At present, 
phenomena measured in the intact or- 
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"Membrane desensitization" may play a role in long-lasting 


ganism cannot be analyzed in terms 
of neuronal mechanisms. ^ Analytic 
studies, while utilizing simpler systems, 
generally deal with changes whose rela- 
tionships to alterations in the behavior 


of the intact organism are not always 
clear. i 
The solution to this impasse would 
seem to be a type of relatively long- 
lasting and nontrivial alteration in be- 
havior as a result of training which has 
identical characteristics both in the in- 
tact organism and in simplified prep- 
arations, where some degree 
neuronal analysis is possible, 


has been a resurgence of 
recent years 


of 
There 
interest in 


nt y in the phenomena of 
habituation, due in part to studies of 


neural response habituation (eg. 
Iernández-Peón, Jouvet, & Scherrer, 
1957; Sharpless & Jasper, 1956). Be- 
havioral response habituation appears 
to Satisfy both. the requirements of 
behavioral Significance and occurrence 
in simplified Preparations. A wide 
variety of responses of the intact or- 
ganism habituate (Harris, 1943), as 
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di reflex responses of the chronic 
P rem pe (Prosser & Hunter, 

sos ent information has been 
developed about the neural organiza- 
tion of the spinal cord to permit some 
degree of analysis of reflex mechanisms 
(e.g. Eccles, 1964). This paper will 
discuss recent studies of habituation 
with emphasis on neurophysiological 
findings. We will attempt to demon- 
strate that habituation of the spinal 
flexion reflex is identical in characteris- 
tics to habituation of more complex 
responses of the intact organism and is 
at least. partially amenable to analysis 
in terms of neuronal mechanisms. 
Original data to be reviewed were all 
obtained using reflex activity of a 
“model” mammalian system: the acute 


spinal cat. 
DEFINITIONS 

Harris (1943), in his careful and 
comprehensive review of the earlier 
literature, defined habituation opera- 
tionally to be "response decrement as a 
result of repeated stimulation | [P- 
385]." Humphrey’s (1953) earlier 
definition, although expressed in less- 
behavioral terms, is substantially the 
Response decrement resulting 
apid stimulation (e.g. 500/ 
ight be due to invasion 


same. 
from very r 
second ) which m 


of the neuronal relative refractory 
period has different characteristics and 
are decrements 


is best excluded, as 
resulting from trauma, 
Habituation can 


growth, aging, 
usually be dis- 


etc. 

tinguished from the latter phenomena 

because it is reversible: Habituated 
cov- 


responses exhibit spontaneous re 
ery. The only authority tO diverge 
from Harris’ characterization of habit- 
uation is Thorpe (1950), who regards 


“relatively permanent waning 


it as a 

of a response as à result of bo. 
"ie ie 

Smelatign [p. 94; fts ours]. t 

course fo! 


he time 


will be seen that t 
1 habituated 


spontaneous recovery of al 


response depends on a great many 
variables; hence distinctions in terms 
of recovery time would seem somewhat 


arbitrary. Certainly Harris’ definition 
has gained widespread | acceptance. 


Several authors have suggested a dis- 
tinction between “short-term” and 
“long-term” habituation processes 
(Shapless & Jasper, 1956: Sokolov, 
1955). 

Two peripheral mechanisms have 
often been assumed either explicitly or 
implicitly to account for response de- 
crement: alterations in receptors or in 
effectors. These may be given more 
specilic operational definitions which 
permit empirical evaluation. In this 
paper, a decrease in receptor activity 
will be called receptor adaptation, and 
a decrease limited to the effector re- 
sponse (including both effector and 
neuroeffector junction) will be termed 
effector fatigue. Following Harris 
(1943), response decrements that can 
be accounted for entirely by these 
mechanisms will not be considered 
habituation. 

lí a common-denominator d 
of learning such as “change in behavior 
under conditions of practice” is adopted, 
habituation must be included as an 
aspect of learning. Habituation can be 
distinguished from many phenomena ot 
learning in that the change in behavior 
is a decrease in response strength. It 
is customary, however, to assume that 
the response exhibiting habituation is 
itself unlearned (Harris, 1943). 
James (1890) and Konorski (1948) 
have proposed a more inclusive term, 

abituation, condi- 


plasticity, to include h ) 
aspects of behavioral 


efinition 


tioning, and other l 
change. Our general definition of 
pply to a variety 


also a 
adaptation, accommo- 


inhibition, negative 
stimulus satiation, 
4ll be used here to 
term the reader 


habituation may 
of other terms: 
dation, fatigue. 
learning, extinction, 
etc. Habituation W 
stand for whichever 
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prefers. Martin (1964) has recently 
proposed that such diverse constructs 
as response habituation, evolutionary 
adaptation, and photochemical dark 
adaptation have a common conceptual 
basis. The utility of this approach is 
unclear. 


PARAMETRIC CHARACTERISTICS OF 
HABITUATION 


The extensive research on behavioral re- 
sponse habituation in the intact vertebrate 
has demonstrated a number of parametric 
relations for stimulus and training variables, 
Although not previously enumerated as such, 
examples of all of these may be found in the 
earlier literature for various reflex responses. 
Some of the more recent Observations, par- 
ticularly those dealing with types of re- 
sponses not covered in Harris’ (1943) re- 


view, will be noted in the brief summary 
that follows. 


l. Given that a particular stimulus 
elicits a response, repeated applications 
of the stimulus result in decreased re- 
Sponse (habituation). The decrease is 
usually a negative exponential function 
of the number of stimulus presentations, 


Examples of Tesponse habituation can 
probably be found in essentially all behay- 
loral studies where a stimulus is regularly 
presented. In earlier experiments devoted 
to habituation, per S6, parametric character- 
istics were studied for a variety of responses 
(cf. Harris, 1943) ranging from Postrotatory 
nystagmus (Griffith, 1920; Wendt, 1951) to 
startle (Prosser & Hunter, 1936) and gal- 
vanic skin response (GSR—Davis, 1934), 
With the possible exception of the 


“knee 
jerk” reflex (Lombard, 1887; Prosser & 
Hunter, 1936) habituation was a consistent 


finding, usually exhibiting an exponential 
course. The recent habituation literature 
contains many investigations of various re- 
sponses which can be subsumed under the 
general heading of response to novel stimu- 
lation, ranging in complexity from simple 
startle through variously defined orienting 
reflexes to curiosity, stimulus satiation, and 
play behavior (Berlyne, 1950, 1960; Bindra, 
1959; Butler & Harlow, 1954; Davis, Buch- 
wald, & Frankmann, 1955; Dember, 1961; 
Dykman, Reese, Galbrecht, & Thomasson, 
1959; Engen & Lipsitt, 1965; Glanzer, 1953; 


Melzak, 1961; Montgomery, 1953; Robinson 
& Gantt, 1947; Rodgers, Melzak, & Segal, 
1963; Scholander, 1960; Sokolov, 1960; 
Welker, 1956, 1961). Interestingly, if a 
single novel stimulus is repeated regularly, 
habituation of approach behavior follows an 
exponential course (Glanzer, 1953). Un- 
learned behavior sequences of birds elicited 
by certain categories of stimuli have also 
been found to habituate under some condi- 
tions (Hinde, 1954; Thorpe, 1956). Finally, 
much of the extensive rescarch on response 
decrement or fatigue under conditions of 
work (Bartley & Chute, 1947; Robinson, 
1934; Solomon, 1948) can be considered 
within the framework of habituation. 


2. If the stimulus is withheld, the 
response tends to recover 


over time 
(spontaneous recovery). 


Spontaneous rec 
of the studies not 


be the most common method of demonstrat- 
ing that a given res: 


Ontaneous recovery is 
ariables and 


of a given 
startle re- 


3. If repeated series of habituation 
training and Spontaneous recovery are 
gen, habituation becomes successively 
More rapid (this might be called poten- 
tation of habituation). 


Humphrey (1933) noted this effect in his 


studies on turtle leg withdrawal 

1 to shell tap. 
Konorski (1948) descri the adr 
Ing response, and i 
many studies where repeated i i 
Series were given (eg. oe 


4. Other things being equal, the 


frequency of stimula- 
rapid and/or more 


pronounced is habituation, 
3 
wat S. Brown, personal communication, 


i 


| J 
2 
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Numerous examples of this were noted in 
the earlier reflex studies (Harris, 1943) as 
well as in more recent work on stimulus 
satiation and curiosity (Glanzer, 1953; 
Welker, 1961). The effect occurs in terms 
of real time course and occurs within certain 
limits in terms of number of trials as well. 


5. The weaker the stimulus, the 
more rapid and/or more pronounced is 
habituation. Strong stimuli may yield 
no significant habituation. f 


This relationship is characteristic of most 
types of responses ranging from simple re- 
flexes (Harris, 1943) to complex explora- 
tory behavior (Welker, 1961).4 


6. The effects of habituation training 
may proceed beyond the sero or asymp- 
totic response level. 


Additional habituation training given after 
the response has disappeared or reached a 
stable habituated level will result in slower 
recovery. Although relatively few experi- 
ments have studied “below-zero” habituation 
as such (Humphrey, 1933; Prosser & 
Hunter, 1936; Wendt, 1931), the observa- 
tions may be viewed as an extension of the 
relationship between number of stimulus 
presentations and degree of habituation. 
Zero response level is of course to some 
degree dependent upon the particular re- 
sponse measures used. 


7. Habituation of response to a 
given stimulus exhibits stimulus gener- 
alization to other stimuli. 
demonstrated generaliza- 
ation to different types 
n, and Porter (1938) 


dal generalization of 
for light and tone 


Coombs (1938) 
tion of GSR habitu: 
of auditory stimulatio 
demonstrated cross-mo' 


the habituated GSR 
stimuli. Mowrer (1934) showed some gen- 


eralization of postrotatory nystagmus p 
ation in the pigeon. In a recent study, 
Crampton and Schwam o apaa 
generalization of optic nystagmus s itual ion 
in the cat to different degrees of angu 
acceleration. 


eee ins 
4 Postrotary optic nystagmus may be an 


exception in that under some conditions the 
de is of habituation is directly related to 
A of rotation (G. Crampton, persona 


communication, 1 


8. Presentation of another (usually 
strong) stimulus results in recovery of 
the habituated response (dishabitua- 
tion). 


This phenomenon appears to be as ubiqui- 
tous as habituation itself and is commonly 
used to demonstrate that habituation has oc- 
curred. Pavlov (1927) was perhaps the first 
to describe this process (i.e, disinhibition) 
in relation to an extinguished conditioned re- 
sponse (CR), but also applied it to the 
habituated orienting response. Humphrey 
(1933) studied dishabituation extensively in 
lower vertebrates. Essentially all responses 
of mammals that can be habituated can also 
be dishabituated (Harris, 1943). It is not 
always necessary for the dishabituatory stim- 
ulus to be strong. In fact Sokolov (1960) 
and Voronin and Sokolov (1960) reported 
that a decrease in the intensity of an auditory 
stimulus results in dishabituation of the ha- 
bituated orienting response in humans. Dis- 
habituation, viewed as neutralization of the 
process of habituation (Humphrey, 1933), 
has been perhaps the most important method 
of distinguishing between habituation and 
“fatigue.” 


9. Upon repeated application of the 
dishabituatory stimulus, the amount of 
dishabituation produced habituates 
(this might be called habituation of 
dishabituation). 


Most studies of dishabituation (see above) 
have noted its habituation. Lehner (1941) 
has done the most careful parametric studies, 
showing that habituation of dishabituation 
follows a negative exponential course for the 
startle response in the rat and the abdominal 
reflex in man. More recently, Hagbarth and 
Kugelberg (1958) and Hagbarth and Finer 
(1963) verified and extended Lehner’s find- 
ings for the abdominal and leg flexion re- 
flexes in humans. Crampton and Schwam 
(1961) have shown that dishabituation of 
postrotatory nystagmus in the cat by audi- 
tory or cutaneous stimuli habituates in a 


similar fashion. 


In reviewing the behavioral habitua- 
tion literature, it is striking to find 
virtually complete agreement on the 
parametric characteristics of the phe- 
nomenon in such a wide variety of 
animals and responses. These nine 
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common characteristics may conse- 
quently serve as the detailed opera- 
tional definition of habituation, replac- 
ing the more general definition given 
above. The extent to which any other 
response decrements satisfy these char- 
acteristics will thus determine whether 
they can be called habituation. 


NEUROPHYSIOLOGICAL STUDIES OF 
HABITUATION 


Electroencephalogram (EEG) Arousal 


The EEG arousal response—a rapid 
transition from slower waves to fast 
low-voltage activity, usually following 
sudden stimulation—was observed to 
decrease upon repeated stimulation in 
early studies both in animals (eg., 
Rhineberger & Jasper, 1937) and man 
(Knott & Henry, 1941). The first 
investigation of this effect, per se, was 
the now classic study by Sharpless and 
Jasper (1956). Using repeated pres- 
entations of brief tones they found that 
cortical EEG arousal of the sleeping cat 
becomes shorter and finally disappears. 
After cessation of stimulation, the 
arousal response exhibited Spontaneous 
Tecovery over a period of minutes to 
hours. Although not studied para- 
metrically, examples of all nine rela- 
tions listed above for behavioral re- 
Sponses are to be found here and in 
subsequent confirmatory studies (Cas- 
pers, Lerche, & Greuter, 1958; Roig & 
Sommer-Smith, 1959; Segundo, Roig, 
& Sommer-Smith, 1959), Apelbaum, 
Silva, Frick, and Segundo (1960) 
noted a more restricted generalization 
gradient when using waking cats than 
Sharpless and Jasper found with sleep- 
ing animals. The human “alpha block- 
ing” response has been shown to habit- 
uate to tactile, auditory, and visual 
stimulation (Sokolov, 1960; Voronin & 
Sokolov, 1960). Cortical EEG arousal 
also exhibits habituation, spontaneous 
recovery, and  dishabituation when 


elicited by electrical stimulation of the 
mesencephalic reticular formation 
(Glickman, 1958; Glickman & Feld- 
man, 1961; Roig & Sommer-Smith, 
1959). Habituation of electrographic 
arousal has also been reported for thal- 
amus and reticular formation (Sharp- 
less & Jasper, 1956), hippocampus 
(Green & Arduini, 1954), and olfac- 
tory bulb (Adrian, 1950; Hernández- 
Peón, Lavin, Alcocer-Cuarón, & Mar- 


celin, 1960). 


Sensory Evoked Potentials 


A number of investigators have re- 
ported habituation of click- or tone- 
evoked responses in various regions 
of the central nervous System (CNS) 
ranging from the cochlear nucleus to 
the auditory cortex and mesencephalic 
reticular formation in the waking ani- 
mal (Galambos, Sheatz, & Vernier, 
1956; Gershuni, Kozhevnikov, Maru- 
seva, Avakyan, Radionova, Altman, & 
Soroko, 1960; Hernández-Peón et al, 
1957; Hernández-Peón & Scherrer. 
1955; Jouvet & Hernández-Peón. 
1957). Worden and Marsh (1963) 
have emphasized the importance of 
controlling stimulus intensity, demon- 
strating Statistically reliable changes in 
response, and conducting other neces- 
Sary contro] Procedures which were not 
always observed in the studies listed 
above. Experiments Satisfying all 
these methodological criteria have not, 


, 1961; harpless & Jasper, 1956; 
However 
en (1964) have re- 
decreases in click- 
ron áà amplitudes from the 
auditory cortex when the EEG was de- 
synchronized, even in the absence of 
any consistent Tesponse changes at the 


reported 
evoked response 
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cochlear nucleus. In a recent and care- 
ful study, Webster, Dunlop, Simons, 
and Aitkin (1965) reported a rapid 
initial decrease in the amplitude of 
sound-burst-evoked responses of the 
cochlear nucleus that occurred during 
the first 30 seconds of stimulation, if 
the sound (20-millisecond duration) 
was presented 1/second or faster. No 
decreases occurred with slower rates 
of stimulation. (The experiments de- 
scribed above generally presented stim- 
uli slower than 1/second and examined 
responses over periods of many min- 
utes or hours.) Auditory evoked re- 
sponses of the reticular formation do 
appear to habituate in the sleeping ani- 
mal (Huttenlocher, 1960). Auditory 
evoked cortical association responses 
do not habituate (Shaw & Thompson, 
1964) ; in fact the amplitude of this re- 
sponse increases during the first few 
presentations of a click (Thompson & 
Shaw, 1965). Head position in the 
sound field is a powerful and consistent 
determiner of evoked-response ampli- 
tude (Marsh, Worden, & Hicks, 
1962). Contractions of middle-ear 
muscles have been implicated in some 
studies of auditory evoked response 
habituation (Guzman-Flores, Alcaraz, 
& Harmony, 1960; Hugelin, Dumont, 
& Paillas, 1960), but denied by other 
workers (Altman, 1960; Moushegan, 
Rupert, Marsh, & Galambos. 1961). 
Further discussion of methodological 
problems is given in the careful studies 
by Marsh et al. (1962) and Worden 
and Marsh (1963). (See also the dis- 
cussion by Deutsch, 1962.) —/ 
Many reports have noted habituation 
of evoked responses from various re- 
gions of the visual system to repeated 
flash stimulation (Cavaggioni, Gian- 
nelli, & Santibañez, 1959; Hernández- 
Guzmán-Flores, Alcaraz, & 


Peón, mh 
Fernandez-Guardiola, 1958; Manei, 
Meulders, & Santibañez, 1959; Pales- 


tini Davidovich, & Hernández-Peon, 
j 


1959). Pupil diameter apparently was 
not controled in many of these early 
studies. When the eye is atropinized 
and an artificial pupil used, the evi- 
dence for visual evoked response habit- 
uation is inconsistent. Fernandez- 
Guardiola, Roldán, Fanjul, and Cas- 
tells (1961) found no habituation of 
responses to flash under these condi- 
tions recording from the optic tract and 
lateral geniculate body and inconsistent 
changes from the visual cortex: Some 
preparations showed partial decreases, 
particularly of the later components of 
the evoked response, and others showed 
no decreases. Naquet, Regis, Fischer- 
Williams, and  Fernandez-Guardiola 
(1960) noted that flash-evoked re- 
sponses of the visual cortex decreased 
when pupil diameter increased (con- 
comitant with EEG  desynchroniza- 
tion), but did not change when an arti- 
ficial pupil was used. Affanni, Man- 
cia, and Marchiafava (1962) reported 
no reduction in responses of the visual 
cortex to a flash repeatedly presented 
for hours when an artificial pupil was 
used. On the other hand, Garcia- 
Austt (1963) reported decreases in 
scalp averaged evoked responses to 
flash in humans with artificial pupils, 
and decreases of averaged visual 
evoked responses from the rat after re- 
moval of the iris. Responses evoked 
by somatic sensory stimulation have 
been reported to habituate (Hernán- 
dez-Peón, 1960;  Hernández-Peón, 
Scherrer, & Velasco, 1956), and to 
exhibit no habituation (Santibañez, 


Trouche, & Albe-Fessard, 1963). 


Conclusion 

The evidence for EEG arousal habit- 
uation is clear and consistent; it ex- 
hibits essentially all parametric rela- 
tions characteristic of behavioral re- 
sponse habituation. Studies of evoked 
responses, on the other hand, permit 
only one safe generalization at present : 
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There is as yet no consistent agreement 
regarding the occurrence of habituation 
in sensory systems of the waking brain. 
Where habituation has been reported, 
it may be the result of decreases in ef- 
fective stimulus intensity due to altera- 
tions in head position, middle-ear 
muscle contractions, pupilary constric- 
tion, etc, or an indirect result of 
changes in state of EEG arousal. The 
issue can only be settled by further 
research. 


EFFECTS or NEURAL Lesions on 
BEHAVIORAL RESPONSE HABITUATION 


Although the majority of studies de- 
scribed here were not planned as para- 
metric investigations of the effects of 
lesions on behavioral response habitua- 
tion, per se, many were so designed 
that such effects could he noted. The 
orienting response exhibits markedly 
reduced habituation following chronic 
total decortication in dogs (Konorski, 
1948; Lebidinskaia & Rosenthal, 1935) 
and chronic decerebration in cats 
(Bard & Macht, 1958). Lesions of 
auditory cortex in the cat reduce the 
extent of intersession habituation of 
orientation to sound (Thompson & 
Welker, 1963). Activity level of mon- 
keys with Cortical Area 9 lesions ex- 
hibits less habituation than that of nor- 
mal animals (French, 1959; French & 
Harlow, 1955). Frontal or temporal 
lesions in monkeys reduce the amount 
of visual exploration and decrease the 
extent of habituation in repeated test- 
ing sessions (Butler & Harlow, 1954; 
Symmes, 1959, 1963). Frontal lesions 
have a comparable effect in the cat 
(Hagamen, Lance, & Ungewitter, 
1959). Damage to frontal cortex de- 

creases the extent of habituation of the 
heart-rate response to the cold-pressor 
test in man and the rat (Glaser & Grif- 
fin, 1962; Griffin, 1963). Schwartz- 
baum, Wilson, and Morrissette (1961) 
report less-rapid habituation to novel 


stimuli following bilateral amygdalec- 
tomy. Pribram (1963) reports unpub- 
lished results by Kimble and Bagshaw 
showing comparable effects of amygda- 
lectomy on GSR response to novel 
stimulation and emphasizes the import- 
ance of this structure in habituation. 

Response alternation or variability in 
maze behavior can be related to habit- 
uation of response to novel stimuli (see 
below; also Glanzer, 1953; Welker, 
1961). Partial cortical lesions in the 
rat reduce the tendency to vary move- 
ment through a simple maze (Krechev- 
sky, 1937a, 1937b, 1937c). Frontal 
lesions are more effective than parietal 
or occipital lesions (Morgan & Wood, 
1943). Roberts, Dember, and Brod- 
wick (1962) found that bilateral abla- 
tion of the hippocampus resulted in less 
habituation of T-maze responses and 
less alternation behavior. 

Results of these studies, involving 
various responses and many different 
neural structures, are in essentially 
complete agreement. Brain damage 
tends to produce a reduction in the 
extent of behavioral response habitua- 
fion. There is some degree of spe- 
cificity in that damage to structures 
related to particular responses tends to 
have a greater effect on habituation. 
It would seem that habituation may 
be mediated, at least in part, by what- 
ever neural structures are most in- 
volved in a particular response, sug- 
gesting that common neuronal mechan- 
isms may be involved. 


SPINAL REFLEXES 


Sherrington noted a decrement of 


the digital flexion of the monkey in 


1898, but his experiments on the acute 


(1906) were the first to 
a : habituation of Spinal reflexes 
in detail. Although he used the term 
fatigue, he was Careful not to imply 
any necessary meaning other than re- 
sponse decrement by the term, He 


spinal dog 
analyze h 


A 


IL 


. found 
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that continuous or repeated 
mechanical or electrical stimulation of 
the appropriate skin region resulted in 
a decrement of the scratch reflex and 
the flexion reflex, the former decreas- 
ing more rapidly. Both exhibited 
spontaneous recovery. Several lines of 
evidence suggested that both receptor 
adaptation and muscle fatigue could be 
ruled out. Indirect evidence from a 
variety of reflex-interaction studies led 
Sherrington to conclude that the 
“final common path,” that is, the motor 
neurons, were not fatigued. Prosser 
and Hunter (1936) completed a care- 
ful and extensive study of habituation 
and dishabituation of the flexion reflex 
in the chronic spinal rat, using elec- 
trical or mechanical stimulation of the 
legs, tail, and hindlimb nerves. They 
demonstrated habituation, spontaneous 
recovery, and dishabituation by strong 
extraneous stimulation. Weak stimuli 
led to more rapid habituation than 
strong stimuli. In one experiment 
they showed that no habituation oc- 
curred in the knee-jerk reflex of a 
spinal cat. Lehner (1941) demon- 
strated that the duration of dishabitua- 
tion of the flexion reflex in the spinal 
rat habituated. 

In conditioning studies of the acute 
spinal dog, Shurrager and Culler 
(1941) and Shurrager and Shurrager 
(1941) demonstrated habituation of 
the leg flexion reflex to tail stimula- 
tion. More recently, Nesmeianova 
(1957) has demonstrated habituation, 
spontaneous recovery, and dishabitua- 
tion of the scratch reflex and the 
flexion reflex in chronic spinal dogs. 
Habituation below zero was marked; 
the response sometimes required days 
to recover after long periods 


or weeks 
of training. Kozak, McFarlane, and 
Westerman (1962) have reported sim- 


ilarly long-lasting habituation of flexion 


and scratch reflexes in the chronic 
spinal kitten. Habituation of the 


scratch reflex has also been reported 
for the chronic spinal frog (Afelt, 
1963). Lethlean (1965) has presented 
evidence suggesting that the flexion re- 
flex may continue to habituate for many 
minutes after the stimulus has been 
withdrawn in the chronic spinal rat. 
Very recent experiments by Buchwald, 
Halas, and Schramm (1964, 1965) re- 
ported habituation of single-unit dis- 
charges of motor neurons recorded 
from ventral root fibers in the acute 
spinal cat, and a preliminary summary 
of our own work has already appeared 
(Spencer, Thompson, & Neilson, 1964). 


REFLEX HABITUATION IN THE 
ACUTE SPINAL CAT 


Studies reviewed above indicated that 
habituation of the flexion reflex in the acute 
mammalian spinal preparation appears to 
exhibit some of the parametric relations 
characteristic of behavioral response habitu- 
ation in the intact organism. Since a good 
deal is known about the synaptic organiza- 
tion of the spinal cord, this preparation was 
chosen as a model system in which to study 
detailed characteristics of habituation and 
the possible neuronal mechanisms involved. 

The preparation used in these experiments 
was the acute unanesthetized decerebrate cat 
with spinal transection at T-12. Procedures 
are described in detail in papers (in prepara- 
tion) devoted to analyses of neuronal mech- 
anisms. Data given here are all in terms of 
isometric “twitch” contraction strength of 
the tibialis anterior muscle, a hindlimb flexor 
participating in the generalized limb flexion 
reflex to afferent stimulation. Other muscles 
were also studied to insure generality of 
observations. The flexion reflex is poly- 
synaptic, at least one interneuron intervening 
between the incoming afferent fibers and 
the output motor neurons. Dishabituating 
stimuli were either electrical or natural, that 
is, pinch. The basic phenomena of habitu- 
ation and spontaneous recovery were ob- 
served on all of the more than 100 cats 
studied. Spontaneous recovery was used as 
the criterion that habituation had occurred 
rather than response decrement due to de- 
terioration Or various artifacts. Although 
not shown in many of the figures, spon- 
taneous recovery was always obtained fol- 
lowing every habituation series, 
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Fic. 1. A: Habituation ( 
128 minutes) of the hindlimb 
shocks. (Stimuli were brief trains of 
onds during habituation and every 3 : 
minute period of no stimuli at about 
response measured is tension develope erior muscle, ex- 
pressed as a percentage of mean initial rol response amplitude, B: ffect of repeated- 
habituation and spontancous-recovery series on degree of habituation, Res 
to control level following first habituati i rehabituate 
Conditions as in A. Data are averages of 10-trial blocks. C: Effect of st 
on habituation. Single shocks give: in one habituation series and 1/second 
in the other to the saphenous nerv averages of 10-tria] blocks, p: Effect of 
stimulus intensity on habituation. of shocks, as in A, Were Pu. ER 
10 seconds to the saphenous nerve neous recovery allowed after each series. 
Voltages refer to output of stimulator and were attenuated, but in the game ratios, when 
delivered to the nerve. Data averaged over 3-trial blocks, > Stimulus generalization of 
habituation. Single shocks to two separate branches of the saphenous nerve, The habituating 
stimulus to one branch was given 1/second, and the test stimulus to the other branch was 
given l/minute. Data are averages over 3-trial blocks for response to the test stimulus 
and averages over the same per 
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In all experiments, initial-control response 
amplitudes were first determined by test 
stimuli delivered no more frequently than 
l/minute. Subsequent response amplitudes 
arc plotted in terms of percentage of mean 
control level.  Habituating stimuli were 
either single shocks (.01-.03-millisecond du- 
ration) or brief trains of five such shocks 
delivered over a 50-millisecond period. Al- 
though trains generally gave somewhat more 
rapid and less variable habituation, the para- 
metric relations of all other variables were 
comparable for both. Stimuli were delivered 
to skin or to afferent nerves. 

Characteristic habituation and spontaneous 
recovery of the spinal flexion reflex are 
shown in Figure 1A. Brief trains of shocks 
were delivered 1/10 seconds to skin during 
habituation (0 minutes to arrow) and 1/3 
minute during recovery. Individual re- 
sponses are given to illustrate normal vari- 
ability. Note the exponential form of ha- 
bituation and the long and variable time 
course of spontaneous recovery. Although 


the time courses of habituation and spon- 


taneous recovery are dependent on the many 
40 separate series 


variables described below, 
involving a variety of conditions were se- 
lected at random from our experiments to 
determine average values. The mean habitu- 
ation time Was 94 minutes (range 1.2-33 
minutes), and the mean recovery time was 
257 minutes (range .5 minute-108 minutes). 

The effect of repeated habituation-spon- 
taneous recovery series on habituation is il- 
lustrated in Figure 1B. Habituation and 
recovery conditions are the same as in 1A, 
and the data shown are averages for 10-trial 
blocks. The second series exhibits signifi- 
cantly more habituation than the first (^ 
«01; sign test). 

Figure 1C shows the effec 
Here single shocks were de- 


frequency. 
livered either at 1/second or 1/3.2 seconds to 


the saphenous nerve. Data are averages 
over 16-second blocks of time. Habituation 
is significantly greater with the more rapid 
stimulus (p <.01; sign test). 

The effect of stimulus intensity on habitu- 
ation is indicated in Figure 1D. Stimuli 
were brief trains of shocks to the saphenous 
nerve every 10 seconds at the indicated 
voltages. Data are averages of 3-trial 
blocks. There is no habituation for the 45- 
volt stimulus, significant habituation for the 
25.volt stimulus, and significantly greater 
habituation for the 7-volt stimulus (p <01 
for each comparison ; sign test). Thus rate 


t of stimulus 


t2 
un 


and degree of habituation are inversely re- 
lated to stimulus strength. 

The development of stimulus generaliza- 
tion of habituation is shown in Figure 1E. 
Single shocks were delivered to two separate 
branches of the saphenous nerve. The ha- 
bituating stimulus was given l/second, and 
the generalization test stimulus to the other 
nerve branch was given l/minute. Data 
are averages over 3-trial segments at com- 
parable times for the habituating stimulus. 
Response to the training stimulus habituated 
to 10% of control level, and the test-stim- 
ulus response decreased to about 60% of 
control level. Amplitudes of the last 10 
habituated, generalized, and control responses 
all differed significantly from each other 
(habituated versus control, ? — 18.1, df — 18, 
$ <.01; generalized versus control, ? = 2.43, 
df =18, p <.05; habituated versus general- 
ized, t= 7.58, df — 18, p <.01). Thus there 
is significant but not complete generalization 
of habituation from one input channel to an- 


‘other. 


The effect of “below-zero” habituation on 
spontaneous recovery is shown in Figure 2A. 
Stimuli were brief trains to the sural nerve 
every 5 seconds. In the recovery curves fol- 
lowing habituation to “zero,” recovery began 
when the response had reached a stable 
level. For the recovery curves following 
habituation below zero, habituation stimuli 
were continued for 10-20 minutes after the 
response had reached a stable level. As seen, 
spontaneous recovery has a longer time 
course following below-zero habituation. 
The habituated levels (ie. zero recovery 
time) are comparable for both. The two 
conditions were alternated 22 times in two 
separate experiments. The mean spon- 
taneous recovery time following habituation 
to zero was 2.1 minutes, significantly less 
than the 7-minute mean recovery time fol- 
lowing below-zero habituation (t = 445, df = 
20, p <.01). 

Two examples of dishabituation are shown 
in Figure 2B. In both cases the response was 
first habituated to single-shock stimulation 
of the tibial nerve at 1/3.2 seconds. Weak 
dishabituation was achieved by increasing the 
stimulus frequency to 3/second and strong 
dishabituation by a comparable increase in 
frequency plus an intensity increase. Ha- 
bituating stimuli were continued in each 
series. Data are averages of 8-trial blocks. 
The weak dishabituating stimulus yielded in- 
complete recovery, and the strong stimulus 
resulted in a response of greater than con- 
trol amplitude. In both cases the response 
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Fic. 2. A: Effect of below-zero habituation on the time course of spontaneous recover 
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returned to habituated levels in about 4 
minutes. 

Figures 2C and D indicate habituation of 
dishabituation. Here four successive dis- 
habituating stimuli were given with recovery 
allowed after cach. The habituating stimulus 
was a brief train of shocks to skin every 10 
seconds. Dishabituation was “natural” in 
this experiment—strong pinching of the 
digits. There is a reduction both in duration 
of dishabituation and in amplitude of dis- 
habituated responses following repeated 
dishabituation. 


The data given above illustrate all 
nine parametric relations characteristic 
of response habituation in the intact 
animal. Consequently the flexion re- 
flex of the acute spinal cat exhibits 
habituation as genuine as that for any 
response of the intact animal. The 
time relations for habituation and spon- 
taneous recovery here are within the 
range of those for responses of the in- 
tact animal. It is perhaps somewhat 
remarkable that a weak stimulus de- 
livered every few seconds for a few 
minutes can produce a relatively long- 
lasting but reversible alteration in the 
behavior of this very much “simplified” 


nervous system. 


Independence of Habituation and 
Dishabituation 


As noted earlier, dishabituation by ex- 
trancous stimulation is customarily believed 
due to disruption of the habituation process, 
whatever the latter may be. If this were the 
case, dishabituation ought not to increase the 
habituated response above its initial control 
level. However in Figure 2B a strong dis- 
habituatory stimulus was scen to increase the 
response well above the control level. This 
observation suggests an alternative hypothe- 
sis, namely that dishabituation is a separate 
facilitatory process superimposed upon the 
habituated system and probably many other 
response systems as well Several testable 
deductions may be made from this hy- 
pothesis. First, nonhabituated responses 
ought to exhibit dishabituation as well as 
habituated responses. Figure 3A shows this 
to be the case: the nonhabituated control- 
level response is increased markedly by a 


dishabituatory stimulus. 


3 3 A; Effect of a strong dishabitu- 
ating stimulus on an unhabituated (i.e., con- 


Fic. 3. 


trol-level) response. (Test stimulus was a 
brief train of shocks to skin every 2 minutes, 
and dishabituation was a strong shock train 
delivered elsewhere on the limb for 15 sec- 
onds. B: Independence of habituation and 
dishabituation. Habituation stimulus, arrow 
to bar, was brief skin-shock train given 1/ 
second and test stimulus given 1/minute. 
Following the strong, pinch, dishabituatory 
stimulus, bar, the habituating stimulus was 
discontinued and a test stimulus delivered 
1/minute. Note that the response dishabitu- 
ates, then rehabituates and spontaneously re- 
covers, even though there is no further 
habituation training.) 


A more crucial deduction concerns the role 
of the habituating stimulus in the decay of 
the dishabituation effect. Tf dishabituation is 
purely a disruption of habituation, a response 
which is dishabituated following habituation 
ought not to rehabituate unless additional 
habituation stimuli are given. Thus in Fig- 
ure 2B if the habituating stimulus was 
turned off after dishabituation, the response 
should remain at the high dishabituated or 
control level. Alternatively, if dishabituation 
is a superimposed facilitatory process having 
an intrinsic decay time, the dishabituated re- 
sponse should gradually decrease toward the 
habituated level, even if no habituation train- 
ing is given subsequent to dishabituation. 
This cruical test of the two alternative hy- 
potheses is illustrated in Figure 3B. After 
habituation to brief trains at 1/second (ar- 
row to bar), the response is dishabituated 
(bar), and the habituating stimulus with- 
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held and introduced only 1/minute to fee 
response amplitude. As seen, the A ges 
amplitude declines (or rehabituates ) cs 
most to the habituated level following d $ 
habituation, even though it is 2m longer 
being habituated. Subsequently it recovers 
spontaneously to the control level. This 
finding has been consistently obtained in 
more than 20 separate experiments. To 
demonstrate it clearly, conditions must be 
such that spontaneous recovery has a long 
time course relative to dishabituation, other- 
wise the habituated response will partialiy 
recover spontaneously by the time dishabitu- 
ation effects are dissipated. Since dishabitu- 
ation is not a disruption of habitvation, but 
rather a separate process having all the 
characteristics of response sensitization ( Hil- 
gard & Marquis, 1940), it will hereafter be 
termed sensitization. 

Although the independence of habituation 
and sensitization has not previously been 
demonstrated as such, several investigators 
have suggested the possibility. Humphrey 
(1933), in his studies of habituation in the 
snail and turtle, noted that a strong dis- 
habituatory stimulus caused a response of 
greater amplitude than prehabituated control 
responses and would also increase control- 
level responses prior to habituation. His 
comment deserves quoting: “These results 
at once suggest that what has been 
termed ‘dishabituation’ is simply an increase 
in excitability, which. under conditions not 
known, may summate with repeated stimu- 


lation [p. 143]." Sharpless and Jasper 
(1956) suggested that dishabituation of 
habituated EEG arousal was probably a 


transient superimposed facilitation. Pavlov 
(1927), it should be noted, used the same 
paradigm we employed in Figure 3B to 
demonstrate that the disinhibitory effect of 
an extraneous stimulus on a previously ex- 
tinguished CR was also transient and had the 
same decay time whether or not the un- 
reinforced conditioned stimulus (CS) 


was 
repeatedly applied. 


Sensitization thus emerges as a sep- 
arate and significant process in its own 
right. A strong stimulus of very short 
duration can cause an increase in the 
excitability of the spinal cord lasting 
for many minutes. Since sensitization 
is independent of habituation, at least 
for the spinal flexion reflex, parametric 
relations Numbers 8 and 9 are not 
really relevant to habituation as such, 
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but rather describe a separate process. 
In maintaining the comparability of 
habituation characteristics for re- 
sponses of the spinal cord and the in- 
tact animal, we are forced to the 
prediction that the independence of 
habituation and sensitization will also 
hold for responses of the intact animal. 
Future research will determine the ex- 
tent to which this is true. The habit- 
uation of sensitization (characteristic 
Number 9), incidentally, now becomes 
simply an additional example of habit- 
uation, the response being increased 
excitability of spinal reflexes by the 
sensitizing stimulus. g 


Neuronal Mechanisms of Habituation 
and Sensitization 


Analytic studies of possible neuronal 
mechanisms involved in habituation 
and sensitization of the spinal flexion 
reflex are presented elsewhere (Spen- 
cer et al., 1964; also papers in prepara- 
tion). Results of these experiments 
will be summarized briefly here. Re- 
ceptor adaptation we 


r [ 1 was ruled out by 
clectrical stimulation of afferent nerves 


with monitored neurograms indicating 
no change in input volleys, Effector 
fatigue was eliminated as a significant 
variable by recording activity from ef- 
ferent nerves and ventral roots rather 
than muscles. The possible role of 
feedback from muscle activity was 
eliminated by nerve and root Section 
and by drug paralysis of neuromuscular 
junctions, including those between 
gamma motor neurons and intrafusal 
muscle fibers. These results indicate 


that habituation and sensitization occur 


within the spinal cord. Tn numerous 


experiments testing possible habitua- 
tion of the monosynaptic reflex we 
have found no evidence of habituation 
in the sense of progressive decrement 
m response having a time course com- 
parable to flexion reflex habituation. 
The excitability of motor neurons in- 
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volved in a habituated response can be 
tested by interposing monosynaptic vol- 
leys activating. them. In addition, 
using both ventral-root electrotonus as 
a population measure, and intracellular 
microelectrode recording of identified 
motor-neuron membrane potentials for 
behavior of neuronal elements, the 
subliminal responses of motor neurons 
during habituation and dishabituation 
were also studied. There was no back- 
ground alteration in the monosynaptic 
reflex excitability of motor neurons 
during habituation, but there was a 
clear-cut decrease in motor-neuron 
polysynaptic potentials. Thus what- 
ever processes are responsible for 
habituation occur prior to motor 
neurons (ie. in interneuron path- 
ways) and cannot be due to inhibition 
acting on the motor neurons. During 
sensitization there was often, although 
not invariably, an increase in the 
excitability of motor neurons. 'These 
observations further substantiate the 
behavioral demonstration of the inde- 


pendence of habituation and sensitiza- 
tion. 

The possibility that habituation is 
due solely to altered conduction in af- 
ferent terminals of nerve fibers stimu- 
lated during training is ruled out by 
the generalization experiment of Fig- 
ure 1E. If habituation were due en- 
tirely to changes in afferent terminals 
of the habituated channel, there would 
be no generalization of habituation to 
the other channel since completely sep- 
arate input channels were used. Gen- 
eralization of habituation did occur, so 
the decrement must have involved cen- 
tral processes shared by both inputs. 

To summarize, it has been proven 
bevond reasonable doubt that habitua- 
tion of the spinal flexion reflex is not 
dependent upon receptor adaptation or 
effector fatigue and occurs after the 
afferent input terminals but prior to 
the output motor neurons in the spinal 


cord. Consequently the process of 
habituation must occur in the course 
of transmission through interneurons. 
Prosser and Hunter (1936), incident- 
ally, suggested some time ago that this 
was probably the case. Sensitization, 
on the other hand, is reflected by in- 
creased excitability of both motor and 
other neurons. Hypotheses and fur- 
ther evidence concerning the processes 
responsible for habituation and sensiti- 
zation are developed below. 


THEORETICAL IssuES—BEHAVIORAL 


Extinction 

The similarity of habituation and 
experimental extinction was perhaps 
first emphasized by Humphrey (1930), 
although Pavlov (1927) had earlier in- 
sisted on the basic comparability of un- 
conditioned and conditioned responses. 
Humphrey (1930, 1933) noted that 
close parallels. exist between the two 
processes in terms of spontaneous re- 
covery, below-zero effect, and dishabit- 
uation.  Konorski (1948), in discus- 
sing the general properties of plasticity, 
notes several other parallels including 
the effect of stimulus frequency and 
potentiation of habituation upon Te- 
peated testing. 

The experimental literature on ex- 
tinction of conditioned responses ac- 
tually contains. many illustrations of 
the nine parametric relations listed 
above for habituation (cf. Brogden, 
1951: Hull, 1943: Kimble, 1961; Pav- 
lov, 1927). In fact many of these are 
among the commonly cited characteris- 
tics of experimental extinction. To as- 
sert that habituation is really extinction 
does not of course constitute any kind 
of explanation for either process. By 
definition extinction presupposes prior 
conditioning, and the variables of train- 
ing have a potent influence on the 
characteristics of extinction. Although 
unspecified prior conditions of learning 
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« » 
may modify or shape many reflex" ac- 
tivities of the intact organism, habitua- 
tion does imply an unlearned response. 
However the fact that habituation and 
extinction appear to show similar para- 
metric features does raise the possibil- 
ity that they may share common under- 
lying mechanisms. Although many 
different pieces of the nervous system 
are clearly involved in extinction and 
habituation of various responses, we 
would like to suggest (in agreement 
with Sharpless, 1964) that common 
neuronal mechanisms exist. Response 
decrements would then be due to a gen- 
eral property of neuronal activity com- 


mon to all such systems. At least 
partial analysis of such neuronal 
mechanisms seems possible in the 


spinal animal. Hypothetical neural 
mechanisms consistent with current 
knowledge of neurophysiology are de- 
veloped below. 


Stimulus Satiation and Related 
Constructs 


A variety of complex behavioral re- 
sponses such as spontaneous alterna- 
tion, exploration, and curiosity have 
been subsumed under the construct of 
stimulus satiation (Glanzer, 1953). 
Glanzer's ingenious theory begins with 
a five-part postulate describing the 
properties of stimulus satiation and de- 
duces a number of testable predictions 
about maze running and exploratory 
behavior. The majority of such pre- 
dictions have been verified, particularly 
where deductions from Hull's postu- 
lated reactive inhibition lead to oppo- 
site predictions (Dember, 1961; Glan- 
zer, 1953). Glanzer's postulate can be 
viewed as a formalized restatement of 
some of the parametric characteristics 
of response habituation listed above. 
All of Glanzer’s deductions may also 
be made from the earlier established 
properties of response habituation, 
His general postulate reads: “Each 
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moment an organism perceives a stim- 
ulus object or stimulus objects, A, 
there develops a quantity of stimulus 
satiation to A [p. 259].” We may 
rephrase it as: Each moment an or- 
ganism responds to a stimulus object 
or stimulus objects, A, there develops a 
quantity of response habituation to A. 
Glanzer's deductions follow equally 
well from this restatement of stimulus 
satiation in terms of response habitua- 
tion. 

Glanzer's approach has been cate- 
gorized as a peripheral rather than a 
central theory (Dember, 1961 ; Walker, 
1958). The above review of neuro- 
physiological studies indicated that 
there is as yet no entirely satisfactory 
or consistent evidence for habituation 
or decrement of evoked responses in 
primary sensory systems at any level 
from receptor to cortex. In conse- 
quence it would seem misleading to 
label as stimulus satiation a decre- 
mental process that does not appear to 
exist at any level of neural activily 
which can be identified with the im- 
mediate responses to Sensory activa- 
tion. Insofar as behavioral responses 
are concerned, the formal properties of 
stimulus Satiation and response habit- 
uation are the same, 


Walker (1958) Suggested that the 
term "action decrement,” 
tral locus, be substituted for concepts 
like stimulus satiation or reactive inhi- 
bition. Although he does not relate it 
to the literature on behavioral habitua- 
tion, its properties appear to be the 
same. Dember (1956, 1961) and 
Dember and Earle (1957 ) have devel- 
oped still another theoretical frame- 
work for this Category of behavior 
which they term “stimulus change.” 
The organism seeks to obtain the 
“ideal” amount of “stimulus complex- 
ity.” Thus in an alternation experi- 
ment, the animal alternates choices not 
hecause he is satiated or habituated to 


having cen- 
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one stimulus, but because the other at- 
tracts him more (Dember, 1961). 
Dember (1956) reported an experi- 
ment designed as a critical test of stim- 
ulus-satiation versus stimulus-change 
hypotheses. Rats were given experi- 
ence in a T maze (one arm white, one 
black) and then tested with both arms 
the same, He argued that satiation 
theory would predict no difference in 
test-choice behavior, but stimulus 
change would predict more choices of 
arm different in experience and test 
sessions. Dember's results verified his 
prediction. This argument would seem 
justified if stimulus satiation is limited 
only to alley brightness. However, if 
the total response pattern of running, 
orienting, turning, and response to 
brightness is considered, then generali- 
zation of response habituation will be 
greater to arm same than arm different. 
Thus analysis in terms of response 
habituation would also predict more 
choices of arm different than arm same. 
As yet there seems to be no clear way 
of differentiating experimentally be- 
tween the alternative hypotheses of 
“avoiding” or habituating to one stim- 
ulus complex versus “approaching” 
another stimulus complex. It has been 
suggested (Brown, 1961) that this dis- 
tinction may be fundamentally untest- 
able. Again, the formal behavioral 
properties of Dember's and Earle s con- 
structs do not appear to differ greatly 
from those listed above. 

To summarize, the formal behavioral 
properties of stimulus satiation, action 
decrement, and search for ideal stim- 
ulus complexity appear similar to the 
properties of response habituation. 
They may well be indistinguishable at 
the behavioral level. The earlier and 
more neutral term, habituation, viewed 
as response decrement, might better 
serve the search for central neural cor- 


relates. 


Reactive Inhibition 


The construct of reactive inhibition 
(/,), developed by Hull (1943) from 
Pavlov’s (1927) more general concept 
of “inhibitory state,” has been shown to 
have considerable predictive utility, 
particularly in experiments relating to 
work performance (Hull, 1943; Solo- 
mon, 1948). It is unquestionably the 
most carefully and extensively analyzed 
construct dealing with response decre- 
ment. 

The parametric characteristics of 
habituation listed above are consistent 
with deductions from reactive inhibi- 
tion, except for Number 5, indicating 
that amount of habituation is inversely 
related to stimulus strength. At first 
glance this may seem contradictory, in 
the sense that response strength is often 
related to stimulus strength. How- 
ever, I, has been developed within the 
framework of performance variables, 
where the stimulus events cannot easily 
be specified, and no close relation be- 
tween stimulus strength and response 
amplitude holds. In habituation, par- 
ticularly of spinal reflexes, initial re- 
sponse strength is directly related to 
stimulus strength. The increased ac- 
tivation of the response by a stronger 
stimulus could overcome the J, buildup. 
Sherrington's analysis of reflex fatigue 
(see below) elaborates this point. 

Glanzer (1953) showed that deduc- 
tions from J, in T-maze alternation be- 
havior appeared to lead to the em- 
pirically contracted prediction that sub- 
jects will alternate turns even if the 
discriminable stimuli on the two sides 
are switched each trial. This has been 
considered a telling argument against 
I, (Dember, 1961; Glanzer, 1953). It 
seems to us that it may be a rather 
artificial contradiction resulting from 
the definition of response which was 
chosen. From the actual response 
sequence of running down the alley, 
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orienting toward and approaching one 
of the stimulus complexes, and hence 
turning, Glanzer abstracts turning as 
the response. If we instead emphasize 
stimulus orientation and approach as 
the more relevant aspect of the re- 
sponse sequence, then application of 
Hull's J, analysis leads to exactly the 
same predictions as does Glanzer’s 
satiation hypothesis. This characteri- 
zation of the response, incidentally, is 
comparable to that used by Spence 
(1960, p. 374) in his analysis of spatial 
discrimination learning. 

To summarize, 7, is not inconsistent 
with the phenomena of habituation, and 
may even be viewed as a molar analysis 
of the role of response habituation in 
behavior. Experimentally contradicted 
predictions made from 7, about alterna- 
tion behavior may be ascribed to the 
particular response definition used. 
The hypothetical neuronal model of 
habituation developed below might well 
be considered as a possible neurophysi- 
ological basis for certain aspects of J,. 


Fatigue 


There is no agreed upon definition of 
fatigue, either in psychology or physi- 
ology. We have defined effector fa- 
tigue as response decrement localized 
to neuroeffector and effector processes 
(cf. Merton, 1954) and shown that it 
cannot account for habituation, at least 
of the flexion reflex. It is extremely 
unlikely that effector fatigue has any 
great importance for habituation gen- 
erally since all responses that show 
habituation can he elicited more 
strongly, more rapidly, and without 
decrement under other conditions. As 
far as central fatigue is concerned, ar- 
bitrary definitions specifying certain 
synaptic states could also be used, but 
have not come into general parlance. 
Sherrington’s (1906, p. 220) molar 
definition of reflex fatigue is perhaps the 
most commonly used: “. a spinal 


reflex under continuous excitation or 
frequent repetition becomes weaker, 
and may cease altogether." He was 
very careful to hold firmly to this 
purely operational definition, which is 
of course identical to the definition of 
habituation given above. In like man- 
ner, Hull (1943, p. 278) identifies re- 
active inhibition with fatigue, the latter 
being defined as decrement in 
action potentiality rather 
than an exhaustion of the energy... 
[italics ours]." 

The inverse relationship between 
stimulus strength and rate and degree 
of fatigue or habituation was analyzed 
very carefully by Sherrington (1906, 
p. 220): 


evocation 


In my experience, these spinal reflexes fade 
out sooner under a weak stimulus than un- 
der a strong one. This seeming paradox in- 
dicated that even under feeble intensities of 
stimulation the threshold 
gradually rises, and that it 
threshold value of the w 
fore it re; 


of the reaction 
rises above the 
caker stimulus be- 
aches that of the stronger stimulus. 

Many students of habituation have 
argued against fatigue as the mechan- 
ism. Most of these have implied effec- 
tor fatigue (e.g., Harris, 1943; Prosser 
& Hunter, 1936). However the more 
general concept of fatigue has also been 
questioned on the grounds that dis- 
habituation could not occur if a simple 
fatigue process were responsible for 
decrement. Such arguments of course 
make an implicit assumption about the 
nature of central fatigue; namely, that 
it is a process which cannot be broken 
up rapidly by dishabituation. However 
when dishabituation is viewed as an 
independent superimposed process 
(ie, sensitization), the argument loses 
its force. Thus in the flexion reflex, 
response decrement occurs in inter- 
neuronal pathways without any back- 
ground change in motor-neuron excit- 
ability, but sensitization is commonly 
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reflected in transient depolarization of 
motor neurons as well. 

In summary, operational definitions 
of central fatigue do not differ from 
definitions of habituation and reactive 
inhibition. Even if this central process 
is characterized as having a slow re- 
covery time, dishabituation is not a 
contradiction if viewed as an independ- 
ent superimposed sensitization process. 


THEORETICAL IssuES—NEURAL 


Stimulus Model Comparison 


On the basis of a wide variety of 
ingenious experiments concerning ha- 
bituation of the human orienting reflex, 
Sokolov (1960) has developed a theo- 
retical neural schema. While this 
theory is somewhat specific to the 
orienting response, it has occasioned 
widespread interest (cf. Magoun, 1963; 
Pribam, 1963) and can be generalized 
to other types of response habituation. 
In essence it is a feedback comparator 
analogue. One system (cerebral cor- 
tex) is concerned with the formation 
of a model of the stimulus-input pat- 
This interacts with an amplify- 
lar formation) which 
ponse tendency. As 
the model develops jn the cortex, retic- 
ular amplification is inhibited. If a 
dishabituating stimulus is given, the 
amplification is no longer inhibited, and 
an increased response Occurs. The 
essential point in the theory 1s that a 
model of the habituating stimulus ac- 
tivity develops, against which any other 
stimulus pattern is compared. _Re- 
sponse strength is directly proportional 
to the degree of discordance between 
any stimulus input and the model. 

ial experimental observa- 


The cruc! 
tions forming the basis of this theory 
concern dishabituation by altered stim- 


ulus conditions where intensity is not 
increased. 


Thus after habituation to 
a given tone, à we 


tern. c 
ing system (reticu 
serves to alter res 


aker tone or a shorter 


4 


tone produces dishabituation* of the 
orienting reflex (Sokolov, 1960). 
Hence a comparison model is neces- 
sary. There are few direct tests of 
Sokolov’s theory. Habituation of the 
orienting reflex does occur, albeit more 
slowly, in the totally decorticated ani- 
mal (Lebidinskaia & Rosenthal, 1935), 
and auditory orientation habituation 
occurs in animals without auditory cor- 
tex (Thompson & Welker, 1963). 
These observations tend to raise diffi- 
culties for the role of the cortex as the 
model formation system, but the gen- 
eral construct could still be used by 
invoking primary sensory relays below 
the thalamocortical level for the model 
comparater. In like manner it could 
be generalized to spinal reflexes by as- 
suming a dual mechanism (as yet un- 
identified) in cord internuncials. 
Perhaps the most important point 
about the theory is that it may not be 
necessary to assume a model-compari- 
son process to account for dishabitua- 
tion by reduced stimulus intensity or 
altered stimulus characteristic. Cod- 
ing of intensity in neural networks is 
complex, and there is not always a 
simple relation between stimulus in- 
tensity and neural activity. Thus in 
the inferior colliculus and auditory cor- 
tex, for any given frequency, some 
neurons fire to weak tones but not to 
strong tones (Hind, Rose, Davies, 
Woolsey, Benjamin, Welker, & Thomp- 
son, 1960; Rose, Greenwood, Goldberg, 
& Hind, 1963). Hence a weak tone 
will activate cells and synapses not 
activated by the stronger tone. If 
habituation is simply some sort of de- 
crease in synaptic efficacy, then a larger 
response would be expected to a weak 
tone after habituation to a strong tone 
because different neurons are activated. 


5 Tt should be noted that this is a some- 
what special use of the term dishabituation 
in that the extra stimulus is subsequently 


used as the test stimulus. 
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Consequently no sort of comparison 
against a model is necessary; the pro- 
cess can equally be construed as 
straightforward decreases in the effi- 
cacy of certain pathways, together with 
differing degrees of pathway overlap 
resulting from differing stimulus con- 
ditions.  Dishabituation by change to 
a weaker or otherwise altered stimulus 
then becomes an instance of incomplete 
stimulus generalization of habituation. 
A stimulus having somewhat different 
central connections than the habituat- 
ing stimulus yields a larger net re- 
sponse because generalization of habit- 
uation to the new stimulus is only 
partial. 


Afferent Neuronal Inhibition 


This theory has been elaborated by 
Hernández-Peón (1955) and has re- 
ceived considerable attention (e.g., Liv- 
ingston, 1959; Worden & Livingston, 
1961). In essence it is assumed that 
centrifugal fibers from the reticular 
formation and related structures cours- 
ing to various peripheral sensory re- 
ceptors or nuclei act to inhibit respon- 
siveness at the periphery, thus reduc- 
ing activity in sensory systems. This 
inhibition is said to develop during 
repetitive stimulation and results in 
evoked response habituation. 

There are sound neurophysiological 
reasons for believing that some types of 
efferent control systems do exist, al- 
though they do not always involve the 
reticular formation. Two such sys- 
tems have been described for the audi- 
tory system, one acting on the cochlea 
(Galambos, 1965; Rasmussen, 1946) 
and one on the cochlear nucleus (Des- 
medt, 1960; Desmedt & Michelse, 
1958). However neither is coexten- 

sive with the reticular formation. For 
the optic system, Granit (1955) has 
demonstrated both excitatory and in- 
hibitory effects on retinal ganglion cells 
by stimulation of the midbrain tegmen- 
tum. In the somatic system, Hag- 
barth and Kerr (1954) showed that 


reticular stimulation could yield both 
facilitatory and inhibitory effects on the 
second-order ascending sensory neu- 
rons in the spinal cord. Kerr and 
Hagbarth (1955) found diminution of 
olfactory-bulb activity when stimulat- 
ing a variety of nonreticular structures 
Such as prepyriform cortex, amygda- 
loid nucleus, and olfactory tubercle. 

In sum, efferent control Systems ap- 
pear to exist for all sensory modalities 
studied. Control is sometimes exerted 
on early central relays rather than on 
peripheral receptors and does not al- 
ways involve the reticular formation. 
These systems could well provide the 
basis for habituation of sensory evoked 
primary responses. Unfortunately, as 
yet there seems to be no entirely satis- 
factory or consistent evidence for the 
occurrence of such habituation (except 
for rhythmic activity of the olfactory 
bulb, Moulton, 1963). The further 
possibility that EEG arousal habitua- 
tion is so mediated is negated by 
studies showing comparable habitua- 
tion phenomena for EEG arousal in- 
duced by central electrical reticular 
stimulation (Glickman & Feldman, 
1961) where peripheral gating could 
not have any role, and by 
and Jasper's (1956) finding that pri- 
mary evoked responses did not habitu- 


ate during habituation of EEG arousal 
to auditory stimuli. 


Sharpless 


Alterations in Synaptic Transmission 


In the course of this article it has 
been suggested at several points that 
common neuronal mechanisms may ac- 
count for many varieties of response 
habituation. Sherrington (1898, 
1906), Prosser and Hunter (1936), 
Sharpless and Jasper (1956), and 
Kennedy and Mellon (1964), among 
others, have all raised the possibility 
that some type of decrease in synaptic 
transmission may form the basis of 
habituation. An hypothetical mechan- 
ism will be developed here for the 
spinal flexion reflex, and its possible 
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applicability to other systems consid- 
ered. 

It was proved above that habituation 
of the flexion reflex in the neurally iso- 
lated spinal cord must occur in the 
transmission through interneurons be- 
tween incoming afferent fibers and out- 
put motor neurons. At least three 
general classes of neural events may 
be postulated to yield decreased neu- 
ron response tendency : synaptic inhibi- 
tion, decreased synaptic efficacy, or 
membrane desensitization. 

Synaptic Inhibition. Within the 
category of synaptic inhibition, two 
general types have been described, 
postsynaptic inhibition (PostSI), re- 
flected in hyperpolarizing inhibitory 
postsynaptic membrane potentials, and 
presynaptic inhibition (PreSI), which 
acts by depolarizing presynaptic ter- 
minals to vield a smaller excitatory 
postsynaptic potential (Eccles, 1964). 
These inhibitory mechanisms have been 
developed for several types of neurons 
in the spinal cord and are now being 
established for certain neurons in other 
portions of the CNS (Eccles, 1964). 
Both types of inhibition could be 
ostulated to act on interneurons medi- 
ating the flexion reflex. However, the 
measured time courses of both these 
types of inhibition (Eccles, 1964) are 
much too short to summate with 1/ 
second stimulation and to account for 
effects lasting several minutes. It 
could be postulated that complex net- 
works involving tonic or cascading and 
cumulative PreSI or PostSI are acti- 
vated by the repeated-stimulus pattern. 
Time relations are still a problem since 
Gerard and Forbes (1928) were able 
to detect some homonymous depression 


p 


up to .8 second after only a single 
shock (to mixed nerves in their experi- 
have found a similar 


ment), and we b 
a a l-second interval. 


depression witl 
se of known types of 


The time course 
Pre- and PostSI is less than about 500 


milliseconds. Evidence from pharma- 
cological studies 1$ also against an 1n- 
gica 


(00000 EHE 


hibitory hypothesis. Strychnine has 
been shown to attenuate greatly all 
types of spinal-cord PostSI on which 
it has been tested (Eccles, 1964). Pic- 
rotoxin has been shown to partially 
block PreSI (Eccles, 1964). We 
have found that habituation of the 
flexion reflex is not significantly al- 
tered following administration of these 
two agents in doses sufficient to obtain 
the reductions in inhibition just noted. 
Finally, it may be mentioned that elec- 
trical indexes of phasic PreSI (such 
as the cord-dorsum P wave) and 
phasic PostSI (hyperpolarizing phases 
of intracellularly and electrotonically 
recorded motor-neuron responses) 
themselves show decrement upon re- 
peated activation of cutaneous afferents 
(Spencer et al., 1964). The N, com- 
ponent of the cord-dorsom response, 
which itself is highly susceptible to 
presynaptic inhibition, often shows no 
decrement during the development of 
habituation, thus providing further evi- 
dence against PreSI. To summarize, 
all existing evidence points away from 
inhibition as the agent underlying this 
type of habituation.* 

Decreased Synaptic Efficacy. In con- 
sidering alternative explanations, it is 
helpful to recall that there are at least 
two other well-known consequences of 
reflex action which are quite different 
from inhibition. The first is a species 
of synaptic depression variously termed 


“low frequency depression,” “defacili- 


tation," “homosynaptic depression,” or 
which has 


"subsynaptic depression" i 
heen studied in the monosynaptic reflex 
pathway (Curtis & Eccles, 1960; Jef- 
ferson & Schlapp, 1953; Lloyd & Wil- 
son, 1957) and also in some of the 
simplest disynaptic systems (Wilson, 


6 However it is to be noted that our ex- 
periments were carried out with single-shock 
stimuli at 1-3-second intervals and that in- 
hibition, particularly the presynaptic variety, 
may play an important role when higher fre- 
quencies are used, as they were au Sherring- 
ton’s (1899, 1906) studies on fatigue or when 
intermittent stimulus trains are employed. 
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1958). Several features of this depres- 
sion are noteworthy within the context 
of our discussion: (a) It is manifested 
as a decrease of PSP amplitude due to 
intermittent low-frequency synaptic ac- 
tivation and is either pre- or _Sub- 
synaptic rather than postsynaptic in 
origin; (b) its time course is too long 
to be accounted for by presynaptic in- 
hibition (Eccles, Kostyuk, & Schmidt, 
1962) ; and (c) it is detectable at those 
stimulus intervals required to produce 
the polysynaptic decrement we have 
described using single-shock stimuli. 
The degree of low-frequency depres- 
sion is greater with increased stimulus 
frequency, as is the case with habitu- 
ation. Although Jefferson and Schlapp 
(1953), recording ventral-root volleys, 
did report diminished depression with 
strong stimuli, which would further 
Parallel habituation, Curtis and Eccles 
(1960), recording motor-neuron exci- 
tatory postsynaptic potentials, did not 
find this effect even with stimuli supra- 
maximal for Group Ia fibers, The de- 
pression produced in monosynaptic 
pathways by low-frequency depression 
is briefer in duration (less than 20 
seconds) than that which we have de- 
scribed, However, it must be remem- 
bered that the time course of the de- 
pression might be modified in our 
experiments due to the fact that inter- 
neurons characteristically fire in brief 
trains of discharges, even to single pre- 
synaptic volleys, and that this would 
constitute a distinctly different test 
pattern which might have a more 
prolonged recovery period. 

The second relevant consequence of 
reflex action, “after discharge” 
(Forbes, 1922; Ranson & Hinsey, 
1930), is opposite in effect and char- 
acteristic of polysynaptic systems. 
This is manifested as a postsynaptic 
depolarization and prolonged discharge 
of inter and motor neurons (Fuortes, 
1954; Hunt & Kuno, 1959), is gen- 
erally more prolonged with stronger 
stimuli, and may build up with repeated 


stimuli (Frank & Fuortes, 1956). 
Preliminary evidence we have obtained 
from extracellular interneuron record- 
ings indicates that such afterdischarges 
are present and build up with stimulus 
repetition during the processes de- 
scribed, 

We would therefore suggest, as a 
working hypothesis, that the ampli- 
tude of repeated polysynaptic test re- 
sponses depends upon the balance be- 
tween low-frequency depression and 
afterdischarge existing at the time each 
stimulus is presented. Viewed in this 
way, the frequency sensitivity of the 
decrement we have described would be 
related to the well-known fre 
sensitivity of low-frequency depression. 
Stimulus generalization would depend 
upon depression of interneuron synap- 
ses shared by the two input channels. 
The greater response stability 


the strong stimulus stren 
flect 


quency 


seen at 
gths might re- 
a greater preponderance of after- 
discharge onto interneurons relaying 
the reflex. This would increase in du- 
ration with strong stimuli and would 
exist during the presentation of succes- 
sive stimuli, thereby facilitating the 
postsynaptic interneuron responses to 
them, in a sense compensating for the 
low-frequency depression. Moreover, 
if the phenomenon of sensitization (i.e., 
Tesponse restoration following a strong 
extrastimulus) be considered a special 
case of very prolonged afterdischarge 
due to a strong stimulus, then many of 
the puzzling features of this phenome- 
non could be more readily understood, 
Wilson’s (1955) postulated mechanism 
of polysynaptic posttetanic potentiation 
(PTP) is an alternative possibility for 
sensitization, as is the recently de- 
Scribed phenomenon of “presynaptic 
facilitation” (Kandel & Tauc. 1964; 
Mendell & Wall, 1965), since there are 
certain obvious similarities to the proc- 
ess of response restoration we have 
described. 

Membrane Desensitization. We 
noted earlier that some forms of re- 


& 


bi. 
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sponse habituation, even of spinal re- 
flexes, may last for days or weeks. In 
the experiment by Kozak et al. (1962), 
for example, a crossed scratch reflex 
of the chronic spinal kitten, elicited by 
rubbing an area of skin for 10 minutes 
daily, exhibited marked habituation, re- 
quiring a period of 8 days to recover. 
The synaptic processes discussed above, 
namely inhibition and decreased effi- 
cacy due to the balance of low-fre- 
quency depression and afterdischarge, 
would both seem to have too short a 
time course, per se, to account for 
such long-lasting habituation. Sharp- 
less (1964) has suggested that a type 
of membrane desensitization process, 
involving decreased responsiveness of 
the postsynaptic membranes of nerve 
cells due to repeated applications of the 
transmitter substances, may be involved 
in long-term habituation. In hyper- 
sensitized denervated smooth muscles 
and glands, for example, administration 
of appropriate excitatory agents such 
as epinephrine several times daily pro- 
duces a desensitization effect that may 
last for several days (Emmelin & 
Muren, 1952; Simeone, 1938). This 
type of hypothesis may be particularly 
appropriate to the chronic spinal prep- 
aration in that spinal reflexes become 
markedly exaggerated or hypersensi- 
tized, as do responses of dennervated 
smooth muscles and glands. Two dif- 
ferent classes or components of re- 
sponses have been distinguished on the 
basis of habituation and spontaneous- 
recovery time courses. Sokolov (1955) 
noted that the “orienting reflex” ex- 
hibited both rapidly habituating and 
slowly habituating components. Sharp- 
less and Jasper (1956) differentiated 
between à “phasic” EEG arousal that 
occurs immediately after a stimulus, is 
of short duration, and is relatively re- 
sistant to habituation and a “tonic” 
component that habituates much more 
rapidly. Lesions of the brachia of the 


inferior colliculus appeared to abolish 
the phasic component (to auditory 


stimuli) without eliminating the tonic 
component. Thompson and Welker 
(1963) found that ablation of auditory 
cortex prevented the development of 
long-term intersession habituation of 
orienting reflexes to sounds, but did not 
alter short-term intrasession habitu- 
ation. 

It may well be necessary to postulate 
at least two different types of neuronal 
processes to account for short-term and 
long-term habituation. Our suggested 
mechanisms of low-frequency depres- 
sion and afterdischarge are perhaps ap- 
propriate to short-term effects and 
Sharpless’ “membrane desensitization” 
to long-term effects, Alternatively, all 
these processes may interact to a 
greater or lesser extent in all instances 
of habituation. The fact that, except 
for time course, both short-term and 
long-term habituation exhibit the same 
parametric features may imply some 
identity of underlying processes. 


CONCLUSION 


In summary, we would suggest that, 
at least for the spinal flexion reflex as 
we have studied it, the process of ha- 
bituation may represent the cumulative 
effect of a polysynaptic low-frequency 
depression and that dishabituation is a 
type of superimposed sensitization, pos- 
sibly resulting from facilitatory after- 
discharge. 

In amplifying this scheme, one might 
postulate, on the basis of other evidence 
(Frank & Fuortes, 1956), that rather 
long iterative trains of interneuron 
discharges are produced by each stimu- 
lus, and that the long duration of the 
changes noted depends on this feature 
of the test response. If one provision- 
ally adopts this point of view, several 
aspects of the literature on habituation 
reviewed here may be seen in somewhat 
different perspective. The absence or 
poor development of evoked response 
habituation in primary sensory sys- 
tems would be consistent with the fact 
that the interneurons exhibiting very 
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prolonged discharge patterns to single 
stimuli are not typical of the long re- 
lays from receptors to cortex in these 
systems. However such iterative firing 
is more characteristic of neurons in the 
system mediating EEG arousal; the 
reticular formation (Moruzzi, 1954), 
which does exhibit habituation (Bell, 
Sierra, Buendia, & Segundo, 1964; 
Sharpless & Jasper, 1956). In like 
manner, such interneurons are un- 
doubtedly present in all functional sys- 
tems of the intact organism partici- 
pating in behavioral response habitu- 
ation. 

Other things equal, complex re- 
sponses tend to habituate more rapidly 
than simpler responses. If degree of 
response complexity is reflected in in- 
creased numbers of interneurons inter- 
posed between stimulus and response, 
and in an increase in the duration of 
their repetitive discharges, these fac- 
tors themselves might account for more 
rapid habituation. The effects of CNS 
lesions on behavioral response habitu- 
ation may also be reconsidered in this 
way : Effective lesions remove many in- 
terneurons normally involved in com- 
plex responses and hence could lead to 
slower response habituation. 

Many of these extrapolations from 
data obtained in the study of response 
decrement in the spinal cord to habitu- 
ation as it appears in the more intact 
organism are obviously speculative, 
"They do have the virtue, however, of 
indicating the directions which cellu- 
lar theories of habituation could take. 
Quantitative theoretical models would 
clearly be of great help in making such 

cellular explanations useful alternatives 
to the more molar theories commonly 
applied to behavior of the intact or- 
ganism. 
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SIGNAL-DETECTABILITY THEORY OF RECOGNITION- 
MEMORY PERFORMANCE: 


THEODORE E. PARKS? 


University of Wisconsin 


Psychophysical signal-detection theory is applied to recognition-memory 
performance. Old items in the test list are considered to be analogous 
to "signals" while new items are analogous to “noise only." The re- 
sulting fundamental assumptions describe the covert responses which 
mediate recognition-memory performance as varying continuously in 
strength. Covert responses to both old and new items are normally 
distributed with the distance between distributions representing learning 
and retention. The overt response to an item depends on whether or 
not S's covert response exceeds an arbitrary criterion. The available 
evidence suggests that S's criterion will be set such that he will most 
probably choose a number of items approximately equal to the number 


of old items in the test. 


A test of recognition memory typ- 
ically involves presenting the subject 
with a list composed of old items (i.e., 
items that were in an inspection list 
presented previously) and new items. 
He is required to examine each item 
of the test list and to indicate whether 
or not it was a member of the inspec- 
tion list. Two measures of perform- 
ance are taken: (a) the proportion 
of the old items in the test list which 
the subject indicates are old and (b) 
the proportion of new items which the 
subject indicates (incorrectly) are old. 

Customarily, these two scores have 
been combined in a “correction for 
guessing" which was thought to yield 
a single "true" measure of memory. 
Underlying this correction is the 
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assumption that recognition memory 
is a sort of all-or-none process; that is, 
that correct recognitions consist of 
items recognized together with items 
chosen purely through random guess- 
ing. On the contrary, the present 
paper presents evidence that an item 
is not either recognized or not recog- 
nized, but, rather, the covert events 
which mediate overt recognition be- 
havior vary in strength along a 
continuum. 

Under this assumption, the subject 
is faced with a complex decision prob- 
lem. The present theory attempts to 
describe this problem and the sub- 
ject's solution in detail and in such a 
way as to make possible precise 
predictions concerning the effects of 
certain independent variables on rec- 
ognition-memory performance. Fol- 
lowing to some extent the suggestion 
of Egan (1958), many of the assump- 
tions employed are quite similar to 
those found by Swets, 
Birdsall (1961) 
signal detection 
situation.3 


Tanner, and 
to be adequate for 
in a psychophysical 


3 For those read 


ers who are famili i 
Swetsetal, miliar with 


(1961), the application of their 


é 
m 


RecoGNITION-MEMorY PERFORMANCE 45 


GENERAL THEORY 
' Dependent Variables 


In taking the recognition test, the 
subject may be required to choose a 
certain total number of test-list items 
(usually equal to the actual number of 
old items in the test), or he may be 
free to choose as many or few items as 
he desires. The present paper is con- 
cerned primarily with performance in 
the latter situation. Since the pro- 
cedure employed in these experiments 
does not require the subject to pick 
some particular number of items, the 
number of old items that he calls old 
does not necessarily covary with the 
number of new items that he calls 
old. These two scores, expressed as 
the proportion of each type of item 
that the subject calls old, are the pri- 
mary dependent variables of interest. 


Fundamental Mechanisms 


The present theory posits the 
existence of a psychological continuum 
according to which each decision to 
respond "old" or “new” is made. 
This dimension may be thought of as 
representing the degree of familiarity 
associated with test items and accord- 
ingly will be named the familiarity 
continuum (X). It represents that 
property of the covert response to the 
test stimuli which is effective in the 
determination of the subject’s overt 
report. 


e aoe memory may be made 
d ld iu UM test item to be a “trial” 

ems being “signal plus noise” trials 
and new items being “noise only” trials. 
However, the hypothesized mechanism which 
determines the position of the cutoff is not 
equivalent to the optimal strategy rule pro- 
posed by Swets et al. Egan in applying 
Swets et al. to recognition memory was not 
concerned with the precise determinants of the 
cutoff nor with the application of the theory 
to the multiple-choice test situation discussed 


later. 


PROB. DENSITY 


FAMILIARITY 


Fic. 1. Basic assumptions: fo(X), the 
curve to the right, represents the distribution 
of events which occurs on those occasions 
when the subject inspects an old item; fx (X), 
the curve to the left, represents the distribu- 
tion of events which occurs when new items 
are inspected. (Assuming that the subject 
sets his cutoff at (Xc) and responds “old item” 
if and only if (X;) exceeds (X), then (Po(A)) 
represents the probability of the subject re- 
sponding old when in fact the item is an old 
item. (Py(A)) represents the probability of 
the subject reporting old when in fact the 
item is a new item.) 


When the subject inspects an old 
item in the test list, it is assumed that 
a covert event occurs which has some 
value (X; on the familiarity con- 
tinuum. | Furthermore, (X; varies 
from item to item according to a 
normal distribution, fo(X). It is 
further assumed that an event on the 
same continuum occurs when the 
subject inspects any of the new items 
on the test list (as a result of general- 
ization from the old items on transfer 
of prior learning). These events also 
form a normal distribution, Tw(X), 
whose variance is equal to that 
of fo(X). The distance on the con- 
tinuum between the means of the two 
distributions (Xo — Xy) may be de- 
termined in Z units relative to the 
variance of the item distributions." 
These assumptions are illustrated in 
Figure 1. e EX ca 

The quantity (Xo — Xy) ijs in- 
‘Ka Xx) corresponds to 


4 Th asure ¢ 
Woe (1961). 


(d^) in Swets et al. 
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fluenced by learning and retention. 
These variables determine the posi- 
tion of the old-item distribution on 
(X). In addition, (Xo — Xx) may 
be affected by generalization (which 
influences the final position of the new- 
item distribution) and by transfer of 
prior learning (which affects the 
initial positions of the new and old 
distributions). If, however, the same 
population of old and new items is 
used in all conditions of an experi- 
ment, then variation in (o - Xy) 
represents variation in learning and 
retention. 

It is further assumed that the 
subject's overt report regarding item 
(2) of the test list corresponds to the 
decision that the event (X; asso- 
ciated with item (7) is a member of 
(foCX)) or, contrariwise, that it is a 
member of (fy(X)). We shall assume 
that this decision is made in terms of a 
cutoff (Xc) set on (X). That is, the 
subject will report that item (i) is an 
old item if, and only if, (X,) > (Xo) 
(see Figure 1). 

For any given position of the cutoff, 
the probability that an old item will 
be reported as old (call this value 
Po(A) or the probability of a “true 
positive") is represented by the area 
under (fo(X)) which lies to the right 
of (Xe). Similarly the probability 
that a new item will be called old (call 
this value Py(A) or the probability 
of a “false positive") in the area of 
(fv (X)) to the right of the cutoff (see 
Figure 1). The proportions of true 
positives (Po(4)) and of false posi- 
tives (Py(A)) actually obtained in an 
experiment are taken to be estimates 

of (Po(A)) and (Py (A)), respectively. 

It follows that if the distance be- 

tween the distributions is fixed (that 
is, if learning, retention, generaliza- 
tion, and transfer are constant), then 
the probability of a false positive 
may be reduced only by moving the 


cutoff to the right. Such a change 
would produce a simultaneous de- 
crease in true positives. 

Egan (1958) presented evidence in 
support of these basic assumptions in 
a recognition-memory test situation. 
His procedure involved noting the 
effects of encouraging or discouraging 
guessing on the probabilities of true 
positives (Po(A)) and of false posi- 
tives (Py(4)). His method for test- 
ing these basic assumptions (i.e., 
plotting “ROC” curves) is equivalent 
to the following procedure. The 
proportions of true positives and that 
of false positives obtained in one of 
the test conditions are taken as 
estimates of the probability of true 
positives, Po(-1), and false positives, 
Py(A), respectively. This pair of 
values is used to estimate the distance 
X Xx) between the old- and new- 
item distributions (see Figure 1). 
That is, a table of normal curve will 
show what the position of the cutoff 
(in units of Z from the mean of the old- 
item distribution) must have been in 
order that the area of the curve above 
the cutoff would be equal to the 
theoretical Po(A). Similarly, the 
value Py(A) is used to find the 
distance between the cutoff and the 
mean of the new-item distribution. 
These two values are added together 
to give the estimate of (X, — Ao. 

When this procedure was repeated 
for each of the test conditions, it was 
found that the resulting values for 
(Xo — Xy) were essentially constant, 
Thus, changes in Po(A) produced by 
encouraging guessing were accom- 
panied by appropriate Changes in 


Py (A). 


The Cutoff Rule: 


The Exbected-Pr. E 
tion Model Nd d 


Beyond 
the present 
Scribe exact] 


these basic assumptions, 
theory attempts to de- 
Y where the cutoff will 
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“be set with respect to the old- and 


new-item distributions. 
. It is assumed that the subject is 
instructed as to the proportion of all 
the test items that are, in fact, old 
items. If the cutoff is set too low, the 
number of items that will be called 
old will probably: be greater than the 
number of old items in the test. 
Similarly, if the cutoff is too high, the 
subject will probably choose fewer 
items than the actual number of old 
items. The unique assumption of the 
present model is that the subject's 
cutoff will be set at such a place on the 
familiarity continuum that he will be 
expected to choose a proportion of items 
equal to the proportion of items which 
are, in fact, old. 

lor any given cutoff, the expected 
proportion of items which a subject 
will choose is given by the straight- 
forward expression: 


EP = (P(0)) (Po(.1)) 
+ GQDGPSCD) [1] 


where EP equals the proportion of 
test-list items the subject will be 
expected to check or, in other words, 
will be most likely to check; P(O) is 
the proportion of all test items which 
are, in fact, old; and P(N) is the 
Proportion of all test items which 
are new. 

_ The basic assumption of this model 
is that the subject's cutoff will be set 


"The word "expected" is {used in the 
mathematical sense. The expected propor- 
be las i of a certain sort (in this case, 

a £m being called old) is the proportion 
which is most likely to occur over a large 
sample of occasions (in this case, a large 
number of test items). The expected propor- 
tion may be found simply by multiplying the 
probability of the outcome (the response) by 
the proportion of occasions (items). á 

Although the subject is not required to 
choose some fixed proportion of the items, we 
assume that he behaves in such a wav as to 
make it most likely that he will choose a 
certain proportion of items. 


such that the expected proportion of 
items checked (EP) will equal some 
proportion (K) of all test items which 
are, in fact, old. That is, it is assumed 
that: 

EP-(K)P(O) 2) 


By substituting Equation 1 into 
Equation 2, we have the definition of 
the expected-proportion cutoff rule: 


(P(0)) (Po(A)) 
+ (PX) C (A) 
= (K)(P)) [3] 


The only unknown of Equation 3, the 
value of (K), is assumed to be in- 
fluenced in an unspecified manner by 
the costs and rewards of correct and 
incorrect responses. In experiments 
in which no deliberate attempt is 
made to vary these values, (K) is 
assumed to be constant. The funda- 
mental assumptions and the expected- 
proportion rule may be tested sepa- 
rately or may be combined and tested 
simultaneously. This latter proce- 
dure will be illustrated shortly. 


EVIDENCE FOR THE EXPECTED- 
Proportion MODEL 


The Davis, Sutherland, and Judd 
Experiment 


Davis, Sutherland, and Judd (1961) 
examined recognition-memory per- 
formance using both consonant-vowel 
nonsense syllables and two-digit num- 
bers as stimuli.6 The data for the 
nonsense-syllable condition will be 
examined first. ] 

Each nonsense syllable consisted 
of 1 of 18 consonants (omitting q 
and z) followed by 1 of 5 vowels. 
A random list of 15 items was pre- 
sented to the subject as the inspec- 
tion list, each item being presented 

© They also tested recall performance with 
these same stimuli, but only the recognition 
data is of interest here. 
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for 1.5 seconds. Immediately follow- 
ing the presentation of the inspection 
list, the subject was given a sheet of 
paper containing the 45 inspection 
(old) items together with 15, 45, or 
75 new items (the test list). Thus, 
the proportion of old items, P(0), in 
the test list was either .500, .250, or 
167. There were 24 subjects in each 
condition. The instructions, which 
were the same in all conditions, re- 
quired the subject to check as many 
old items as he could but not to guess 
wildly. The fundamental dependent 
variables, for our purposes, are the 
proportion of true positives (po(^1)) 
and the proportion of false positives 
(py(A)) obtained in each condition. 
These values appear in Figure 2. 

The test of applicability of the ex- 
pected-proportion model to these 
data is relatively direct. Since the 
conditions of learning and recognition 
are constant (except for the values of 
P(0) and P(N)), the difference be- 
tween the means (Xo — Xy) and the 
value of (K) are presumably constant. 
The values of po(A) and px (4) for any 
one group can be used to estimate 
both of these parameters. 

The data obtained under the P(O) 
= .50 condition were arbitrarily cho- 
sen for this purpose. In this group 
Po(A) was found to be (.694). A 
table of the normal curve shows that 
the probability of a true positive 
would be (.694) if the cutoff were lo- 
cated at (.51) Z units below the mean 

of the new-item distribution. Simi- 
larly, the probability of a false posi- 
tive would be equal to the obtained 
value of (.186) if the cutoff were lo- 
cated at (.89) Z units above the mean 
of the new-item distribution. Add- 
ing these distances (see Figure 1) we 
estimate that the distance between 
the means (Xo— Xy) is equal to 
(1.40) Z units. 

Now in order to estimate (K), these 


same values of Po(-1) and Py(A) are 
placed in Equation 3 along with values 
of P(O) = P(N) = .50. Solving the 
equation, (&) equals .880. 

Since both of these parameters are 
assumed to be constant, we assume 
that the subjects in each of the other 
conditions are faced with two normal 
distributions at a distance of (1.40 Z). 
Furthermore, we assume that their 
cutoff will be set such that they will 
be expected to check a proportion of 
items equal to (.880) times the value 
of (P(O)) for their conditions. Thus, 
for example, when P(O) = .25, they 
will attempt to check (.880), (.25), or 
(.220) of all the test-list items. 

On the basis of these assumptions we 
may predict precisely the values of 
Po(A) and Py(A) which should be 
obtained in each of the other condi- 
tions. This procedure involves a trial 
and error search for the place where 
the cutoff should be set in order that 
Equation 3 will be satisfied." The 


7A cutoff point in terms of the distance 
(Zo) between the cutoff and the mean of the 
old distribution is tentatively selected, and the 
resulting tentative value of Po(A) is found 
from a table of the normal curve. Since the 
distance between the means is assumed to be 
(1.40 Z), there will be a corresponding 
tentative value of Zy and of Py(A). These 
tentative values of Po(A) and Py(A) are 
used in Equation 3 along with the values for 
P(O) and P(N) for the condition. This 
procedure is repeated until a cutoff position 
is found such that the expected proportion 
equals (.880) (P(0)). This procedure may be 


illustrated for the P(0) — .250 condition as 
follows: 


Corre- 
Ten- Ten- spond- Ten- 
tative tative ing tative Expected 
Zo Po(A) Zw  Pw(A) proportion 
—.20Z .579 120Z 115 231 Reject 
—452Z .560 1257  .106 220 Accept 
—10Z .540 1.30Z  .097 


É .208 Reject 
Since -220 equals (.880) (.250), we accept the 
prediction that the cutoff will be located .15 Z 
units below the mean of the old-item distribu- 


tion and that, therefore, Po (A) will equal .560, 
and Py(A) will equal .106. 


Y 
\ 


ett 
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R Probability 


N 
o 


P(0) 
Obtained and predicted recognition 


Fic. 2. 
data when nonsense syllables are used as 
stimuli; K = .880; (Xo — Xx) = 1.402 
(Davis, Sutherland, & Judd, 1961). (In this 
and the following figures, the data in paren- 
thesis were used in making the predictions 
and, therefore, must coincide with the 
predictions.) 


cutoff which yields values of Po(-1) 
and Py CI) that satisfy this equation 
is taken as the predicted place of the 
cutoff, and these values of Po(.1) and 
Py (A) are taken as the predictions of 
bo(A) and py(.1), respectively. The 
resulting predictions are compared to 
the obtained data in Figure 2. 

In another experiment reported in 
the same paper Davis et al. (1961) 
used numbers as stimuli. These items 
consisted of all possible permutations 
of 10 digits taken 2 at a time. In all 
i ad aspects the experiment was 
xs to the one given above, and 
One menty the analysis is the same. 
PO) again, the data obtained when 

) = .50 is used to estimate (X) 
and (Xo — Xy), and these values are 
used to predict the data under P(O) 
= .250 and P(O) = .167. 

I he Davis et al. experiments, while 
providing excellent support for the ex- 
pected-proportion model, leave cer- 
tain points unclarified.* The entire 

£ The goodness of fit of predicted to ob- 
tained data is ascertained by inspection 
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test list was exposed, and the sub- 
ject was allowed to respond to the 
items in any order he wished. Under 
these conditions the subjects can 
produce data which are consistent 
with the expected-proportion model 
without, technically, using a cutoff 
at all (although the other, more 
fundamental assumptions would need 
to be valid). That is, the subject, 
faced with the two normal distribu- 
tions, could start checking off the items 
with highest familiarity and work 
down to items with lower familiarity 
and simply stop when he had checked 
a number of items approximately 
equal to the number of old items on 
the test list. Such a strategy, on the 
other hand, would be impossible if the 
subject were allowed to view only a 
single test item at a time and was 
required to make his decision on that 
item before moving on to the next 
item. A demonstration that the ex- 
pected-proportion model is adequate 


throughout this paper except in Experiments 
I and II described later. The test used in the 
latter ca requires data for individual 

hese data are not available for the 


subjects. ; 
other experiments that are discussed. 


X Predicted 


> 
o 


R Probability 


iS 
o 


P(0) 


nd obtained recog- 
two-digit numbers 
— Xx) 


The predicted a 
a when 


K = 806; (Xo 


Fie. 3. 
nition-memory dat i 
are used as stimuli; 
= 1.23 Z. 
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under such a serial-test procedure 
would provide less ambiguous support 
for the notion of a cutoff. Two 
studies by the present author were 
designed to shed some light on this 
question. The method and results 
of these studies are described in detail 
in Parks (1964). 


Experiment I 


A total of 10 groups of 8 subjects each 
was presented with a 16-item list of con- 
sonant-vowel-consonant trigrams of low asso- 
ciation value (Archer, 1960). Each subject 
viewed each item of the list only once, the 
exposure of each item being 1.67 seconds with 
a 1.33-second break between items. All 
subjects received the same inspection list, but 
eight different orders of presentation were 
used within each group. Immediately follow- 
ing the presentation of the last inspection 
item, the subject was presented with a test 
list composed of a random mixture of the 16 
inspection items and 4, 8, 16, 32, or 64 new 
consonant-vowel-consonant trigrams. Thus, 
the proportion of test items that were old 
items was either P(O) = .80, .67, .50, .33, or 
.20. In each case the subject was carefully 
instructed as to the number of old and number 
of new items which would appear in his test 
list. Care was taken to balance the effects 
of the particular new items used across the 
levels of P(O). 

In addition to differences in P(O), the tests 
differed in response procedures. Half of the 


X Predicted 
O Obtained 


R Probability 


P(O) 


Fic. 4. Obtained and predicted data for the 
free-responding condition of Experiment I. 
(Predictions are made from the expected- 
proportion model and the P(0) = .80 data; 
K = 971; (Xo — Xx) = 1.39 Z.) 


X Predicted 


.60 


R Probability 
è 


iv 
[e] 


o 
o 


P (0) 


Fic. 5. Obtained and predicted data for 
the serial-responding condition of Experiment 
I. (Predictions are made for the expected- 
proportion model and the P(O) — .80 data; 
K = .906; (Xo — Xx) = 1.56 Z.) 


groups were allowed to uncover the entire list 
at the beginning of the 
any order they desi 
condition). 


test and to respond in 
d (free-responding" 
The other five groups had a 
ies of sliding masks over their test lists 
such that they could view only a single item 
at a time and were forbidden to return to an 
item already examined (“ 
condition). 

In both cases the subjects w 
to underline any item which they thought had 
been on the original list. They were permitted 
to guess but encouraged not to guess wildly 
The actual lists used in the two responding 
conditions were identical except for the masks 
placed over the lists in the serial condition 
Finally, in both conditions, the subjects Were 
free to take as much time as they desired but 
were encouraged to “keep moving along." 


The treatment of the data is 
identical to that employed above with 
the data of Davis et al. The Po(.1) 
and Py(A) scores obtained in P(O) 
= .80 under the free-responding con- 
dition are used to estimate (Xo — Xy) 
and (K) and, from these values, to 
predict the remaining free-responding 
data. The mean data and predictions 
appear in Figure 4. Similarly, the 
serial-responding P(O) = .80 data 
are used to predict the performance 
of serial-responding groups (see Fig- 
ure 5). 
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" Fic. 6. Obtained and predicted data of 
[experiment Il in which the P(O) of the first 
few items was equal to the overall P(O) for 
the condition. (Predictions are made from 
the expected-proportion model and the 
4 v m data; K —.809; (Xo — Xx) 
= 1.52. 


The goodness of fit of each of the 
four predicted curves was evaluated 
using a test suggested by Grant 
(1962). Essentially this procedure 
tests the correlation of predicted and 
obtained means against the variance 
within conditions. The null hy- 
pothesis is that the correlation is no 
greater than that which would be ex- 
pected on the basis of chance. The 
results are as follows: free responding, 
Po(A): F = 24.068; free responding, 
Py(A): F = 17.716; serial respond- 
ing, Po(A): F = 19.826; and serial 
responding, Py(A): F = 5.030. In 
each case df = 1/35. All of these are 
significant at the 05 level of confi- 
dence. In fact only the serial-respond- 
ing Py(A) measure fails to achieve 
significance at .01. 

Inspection reveals that under the 
serial-responding condition the ob- 
tained value of Px(4) is lower than 
predicted at P(0) — .50 and higher 
than predicted at P(O) = .20. Re 


examination of the test lists employed 
in these conditions suggested that this 
effect. might have been due to the 
proportion of old items among the 
first few items of the test list. It was 
found that the mean proportion of old 
items in the first 10 items for the 
P(O) = .50 lists was .60. The mean 
proportion among the initial 10 items 
for the P(0) =.20 lists was .10. 
These differences between overall 
P(O) and P(O) within the initial test 
items might well have made the sub- 
jects more conservative in the former 
case and more liberal in the latter. 
This variable would not systemat- 
ically affect the free-responding sub- 
jects since they viewed the items in 
any order desired. A second experi- 
ment supports this hypothesis. 


Experiment II 

Two groups of eight subjects each were 
tested under identical conditions to those of 
the serial-responding P(O) = .50 and P(0) 
= .20 groups of Experiment I. In this case, 
however, the test lists employed were ran- 
domly constructed but with the restriction 
that the P(0) = .50 lists each have 5 old 
items among the first 10 test items and lists 
for P(0) = .20 each have 2 old items among 
the first 10 items. 

The results of this study appear in Figure 6 
along with the theoretical curve generated 
from the P(O) = .50 data. In this case the F 
value for correspondence of obtained and 
predicted means is equal to 12.598 for Po(A) 
and 5.000 for Py(A) (df = 1/14, p < 01, and 
p <.05), respectively. 


In these experiments, as in those of 
Davis et al, when P(O) is varied, 
the subjects' performances varied as 
though they were using the expected- 
proportion cutoff rule. Furthermore, 
the fit of the model to the serial- 
responding data indicates that the 
cutoff hypothesis may indeed be valid 
and lends support, in the interests of 
parsimony, to the cutoff type of 
description of free-responding per- 


formance. 


a 
VN 


Finally, it is interesting to note that 
the theoretical curves which fit these 
data are based, in part, on the as- 
sumption that (Xo — Xx) is constant. 
According to the theory, this fact may 
be interpreted as indication that re- 
tention was constant across the vari- 
ous levels of P(O) even though the 
subjects in the lower levels of P(O) 
took more time to complete the test. 
In the P(O) = .80 condition, the 
subjects required an average of 112 
seconds and 121 seconds to complete 
the test in the free-responding and 
serial-responding groups, respectively. 
When P(0) = .20, a mean of 394 and 

273 seconds was required. A 2X 5 

analysis of variance on the time 

measures revealed a significant effect 

of P(O) level (F = 12.901, df = 4/70, 

p < 01). 

From a methodological standpoint, 
this finding illustrates the value and 
even necessity of precise theory since 
it could not have been ascertained by 
simple examination of the data. Per- 
haps, in fact, the main value of the 
present theory lies in its being able to 
provide a means of evaluating reten- 
tion which allows for, rather than 
being confounded with, the particular 
conditions under which retention is 
measured. 

Before proceeding, one possible 
source of confusion regarding these 
tests of the expected-proportion model 
must be discussed. lf the subjects 

tend to choose a certain fixed number 
of items, the resulting pair of po(A) 
and py(.l) values will necessarily 
satisfy Equation 3. But there are a 
variety’ of such pairs that may occur. 
The accuracy of the predictions given 
above demonstrates that the particular 
pair that occurs in cach condition is 
that unique pair of areas that both (a) 
will satisfy Equation 3 and (b) may 
occur when a cutoff is moved along 
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two normal curves of equal variance 
at a fixed distance. 


Other Applications of the Expected- 
Proportion Model 


The expected-proportion model may 
also be tested by holding P(O) and 
P(N) constant while some other 
variable of learning or retention is 
manipulated. In this case, however, 
as we shall see, only the ideas cx- 
pressed in Equation 3 are tested (i.e., 
no test is made of the fundamental 
assumption of normality). 

One such experiment has been re- 
ported by Strong (1912). The num- 
ber of stimuli in the presentation list 
was the independent variable, Either 
5, 10, 25, 50, 100, or 150 magazine 
advertisements were presented one at 
a time at a rate of one per second. 
Immediately following the presenta- 
tion of the inspection list, the subject 
was given a stack of stimulus cards 
containing à random mixture of the 
inspection items (ie., "old" items) 
together with an equal number of new 
advertisements, Thus, the test list 
contained either 10, 20, 50, 100, 200, 
or 300 items. The subject was re- 
quired to sort the test items into four 
stacks according to his degree of 
conviction that each item was an old 
item. For our purposes, however, we 
will lump together the three classes 
that included items which the subject 
"had at least a faint idea he had seen 
before . . . ie, those of which he 
was about 25% sure [p. 449]." The 
proportion of the old items in the 
test list which the subject placed into 
any one of these three classes shall be 
considered to be the proportion of 
true positives (po(4)). The propor- 
tion of all new items in the test list 
which the subject placed into any of 
these three classes is taken as the 
proportion of false positives (Py (A)). 

The values of the parameters ob- 
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TABLE 1 


2 ^ q ut 
PROPORTION Or ‘TRUE POSITIVES AND Fat 


: Posirives OBTA 


AS A FUNCTION Or T Fis 
UNCTION OF THE NUMBER OF ADVERTISEMENTS IN THE INSPECTION List 


Ree 300 40 150 


Condition Sos | No old 

i list items 
Ree 10 80 ! En s i 
Ree 20 40 10 
2 S 
Ls 50 | 40 75 
Rec 100 40 50 
Ree 200 | 40 100 

| 


No. old | Obtained | Obtained 
Wstkems | Pot) Px) 
5 8901. | — 036 
wo | /— 885 .063 
25 823 ‘074 
50 ‘726 n 
100 ‘678 | 135 
180 — | 330 129 


Note.—Test list cons A eee EY 
Test list consisted of the inspection-list items together with an equal number of new items. 


tained in each condition are given in 
lable 1 along with the number of 
subjects for each condition. 
Ordinarily, the expected-proportion 
model might be tested against these 
data by simply placing the estimates 
of P(O) and P(N) obtained in each 
group along with values P(O) = PCV) 
= .50 into Equation 3 and solving for 
(K). The value (A) should, of course, 
be constant. However Equation 3 is 
based on the assumption that the 
subject has been informed of the 
values, P(O) and P(N). According to 
his report, Strong's subjects were ap- 
parently not so instructed. Thus, the 
model must be modified as follows: 


(K)(P(0)! = (P(0)} (Po) 
+ (P(N) (Px) E 


where these prime marks indicate 
values assumed by the subject rather 
than the actual values of the param- 
eters, That is, the prime values are 
fitting parameters. f 

By one interpretation this equation 
simply states that the subject will set 
his cutoff such that the expected 
Proportion of items he believes he will 
check (based on his estimates of P(0) 
and P(N)) will be equal to some 
constant fraction of what he believes 
to be the proportion of test-list items 
Which are old items. 


In order to test this model against 
Strong's data, Equation 4 must be 
rewritten as follows: 


P(N)! 


P(0) (Py CL) [5] 


(K) = (Po CD) + 


Now if it is assumed that P(O)’ and 
P(N)‘ are constant across groups, then 
P(N)'/P(O) will be constant and 
may be treated as a single fitting 
parameter. As usual, (K) is also 
assumed to be constant. Since Po (A) 
and Py (4) are known for each group, 
an equation in two unknowns may be 
written from the data of each group. 
Solving two such equations (from the 
10 and 20 conditions) simultaneously, 
it is found that P(N)'/P(O)' = (2.22) 
and (K) = (.971). 

It is now possible to predict the 
e of Pxw(1) for any group by 
these values together with the 
of Po(.1) for that group in 
Equation 3. These predictions are 
compared to the obtained values in 
Figure 7. Except for data obtained 
with the 300-item list, the fit appears 
to be satisfactory. 

'The expected-proportion model may 
also be applied to data gathered using 
what has been called a multiple-choice 
or forced-choice testing procedure. In 
this case, the test list is divided into 
subsets consisting of one old item and 
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Fic. 7. Obtained and predicted values of 
Px (A) using the modified expected-proportion 
model and obtained Po(A) in each condition. 
(This experiment differed from Davis et al. 
in that (a) P(O) and P(N) were not varied; 
and (b) the subject did not know the value of 
P(0)—Strong, 1912.) 


(n) new items. The instructions em- 
ployed along with this procedure 
usually require the subject to choose 
no more nor less than one item out of 
each subset. This requirement ordi- 
narily would place the experiment 
beyond the scope of the present 
theory. That is, if the subject is re- 
quired to examine a subset and to 
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pick one item before proceeding to the 
next set, he cannot use a cutoff at all 
since it may be that no item exceeds 
his cutoff, and yet he must respond.’ 
On the other hand, the cutoff rule may 
be employed in a multiple-choice 
situation if the subject is free to move 
back and forth among the sublists. 
In this case, if no item exceeds the 
cutoff in one subset, the subject may 
go on to another subset and return 
later. 

In order to apply the expected-pro- 
portion model to this sort of data, all 
of the assumptions of the expected- 
proportion model are made. That is, 
it is assumed that the subject is faced 
with two normal distributions at fixed 
distance on a familiarity continuum 
and that the cutoff is set according 
to the expected-proportion model. 
Furthermore, the additional assump- 
tions are made that (a) in each subset 
the subject will choose the first item 
that he examines which exceeds the 
cutoff, and (b) that (K) = 1.0 Ac- 
cording to Assumption a the proba- 
bility of a true positive (estimated by 
Po(A)) will be given by: g 


do(Á) es expected number of old items that exceed cutoff 


expected total number of items that exceed cutoff 


The expected number of old items 
that exceed the cutoff is equal simply 
to the probability that the single old 
item in each subset exceeds the cutoff 
—that is, it is equal to Po(A). The 
expected number of new items that 
exceed the cutoff is given by the 
probability that any particular new 
item exceeds the cutoff—that is, by 
Py (A)—multiplied by the number of 
new items (z) in the subset. There- 
fore, Equation 6 may be rewritten as: 


Po(A) 
(1) (Po(A)) [7] 


7 () (Po(4)) + () P (4) 


[6] 


„° In this case the subject would probably 
simply pick the item with the greatest 
familiarity response. In support of this 
hypothesis, it can be shown that the raw (un- 
corrected) data reported by Flores (1958) may 
be fit by the equation Po(A) = P(Xo» Xy)” 
where (n) is the number of new items in each 
subset, and P(X55 Xy) is the probability that 
one old item will exceed any one new item. 
This latter quantity is treated as a fitting 
parameter and is assumed to be constant if the 
conditions of learning, retention, and test 
(except 2!) are constant, ' 

For our purposes this assumption is 
equivalent to the assumption that the subject 
rejects some members of each subset (those 
that do not exceed the cutoff) and makes a 
random guess among the remaining items, 


Murdock (1963) has suggested and defended 
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Now if (K) = 1, Equation 3 given 


earlier may be rewritten as: 


— Po(A) 
n 


Py(4) = + 


a bstititing this expression into 
“quation 7 and simplifying: 


Po(A) = Po(-l) [8] 


In other words, it follows from the 
assumptions, that the obtained pro- 
Portion of true positives will be, as 
usual, an estimate of the probability 
that an old item will exceed the cutoff. 
Similarly, it can be shown that py (A) 
estimates Py(A)." Thus, the model 
may be tested in exactly the same 
Manner in which it was tested against 


à hypothesis which is very similar to this 
latter assumption. i 

_ "The score py(4A) is the proportion of new 
items that are called old. Therefore, it is 
equal to the proportion of subsets on which a 
new item is chosen divided by the number of 
new items in each subset. 
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the data of Davis et al. In this case, 
of course, (K) is not a fitting param- 
eter but is assumed to equal one. 

It should be noted that in testing 
the theory against multiple-choice 
data, only the fundamental assump- 
tions of normality and a movable 
cutoff are being tested. That is, it is 
known a priori that the data will 
satisfy Equation 3 since the subject 
is forced by the instructions to choose 
a constant number of items (i.e., one 
per subset). On the other hand, al- 
though a variety of pairs of Po(A) and 
py (el) values will satisfy this require- 
ment, there is a unique pair which will 
also satisfy the requirement of rep- 
resenting corresponding areas under 
two normal distributions at a fixed 


distance. 


The Postman Experiment 


An experiment reported by Post- 
man (1950) used this paradigm. He 
presented a list of 36 nonsense syl- 
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Fic. 8. Obtained and predicted dete 1.46 (Postman, 1950). 
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X Predicted 
O Obtained 


P (0) 


Obtained and predicted data using a modified expected-proportion model; 


(Xo — Xx) = 2.18 (Murdock, 1963). 


lables, the list being presented five 
times in five different orders. Follow- 
ing these learning presentations, a 
written multiple-choice recognition- 
memory test was administered. This 
test consisted of a series of 36 subsets, 
each subset containing one old item 
and either one, three, six, or nine new 
items. Thus, there are four recall 
conditions; P(O) — .50, .25, .14, and 
.10, respectively. These 36 multiple- 
choice subsets were presented on a 
single page, and thesubject was asked 
to indicate which member of cach list 
vas an old item. 
The data obtained under the P(Q) 
= .50 condition are used to estimate 
(ïo — x). This value along with 
(A = 1) and level of P(Q) are used to 
predict the rest of the data in the 
manner described for the Davis et al. 
experiment. The obtained and pre- 
dicted values of poCl) are given in 
Figure 8. The fit appears to be 


satisfactory. 


The Murdock Experiment 


Murdock (1963, p. 18) reports a 
very similar experiment. Using gen- 
eral knowledge statements as stimuli, 
a list of 200 multiple-choice subsets 
was administered to 97 subjects. Of 
these 200 subsets, 67 cont 
correct statement (“old” item) and 
one incorrect statement (“new item), 
67 contained one old and two new 
items, and 66 contained one old and 
three new items. Thus, P(0) = -50, 
33, or .25. The instructions required 
the subject to select one item of each 
subset. The obtained values of po(A) 
and bx(A) after Murdock's correc- 
tion for guessing is removed are given 
in Figure 9 along with the predicted 
values. The Predictions are made 
from the P(O) = 50 data (where (K) 
is assumed to equal D. The fit ap- 
pears to be satisfactory. 

The studies reviewed in this section 
are especially important in distin- 


ained one 
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.guishing the movable-cutoff model 


underlying the present theory from the 
sophisticated fixed-threshold model 
proposed by Luce (1963) for psycho- 
physical detection. Luce's model, 
when suitably translated for applica- 
ue to the recognition-memory situa- 
wil ie be shown to be consistent 
(19 1 the data obtained by Davis et al. 
( i Parka (1964), and Strong 
üld m ^ j Towever, this fixed-thresh- 
ak el is apparently incapable of 
xplaining the precise effect of P(O) 
On performance under the multiple- 
choice procedure of Postman (1950) 
and Murdock (1963). Although the 
data of these latter experiments are 


" Rather than assuming that (D) and (20 
are determined by an asymptotic learning 
Process as described by Luce, it is assumed 
that these biasing factors result from the 
Subject’s attempt to match the expected 
Proportion of old responses to P(O). Thus, 
for example, (0) is set such that: 


(D (q(0)) (P(0)) + W) (aC ND) (P(X) 
: = K (P(0)). 
Phat is: 

oy 4 Pa 
ql P(0) 
Where K is determined, in 
manner, by the cost and values involved. 
follows that: 


ts 


an unspecified 
It 


P(N) 


(A) = K 
PO) (py 


Po(A) + 


ure, the subject 


lU ae cies Seen, dire 
Is dn Strong's procec P (N', then: 


employs estimates P(O)’ anc 
i P(N)! 
bol) + POY 
1 hese predictions are, 
identical to those made fr: 
Portion, movable-cutoff theory. 
The two theories differ in t 
Phisticated fixed-threshold theory 
that, if the conditions of learning and 
tion are constant, the obtained values Poe 
and py (A) will lie on an ROC curve compe 
Of two intersecting straight-line poised 
n practice such a function has. been in^ 
distinguishable from the Gaussian 
function of the movable-cutoff theory. 


(px CD) = K 


as far as they £9 
om the expected-pro- 


hat the so- 
predicts 
] reten- 


consistent with the very broad predic- 
tions yielded by Luce's "A alterna- 
tive" model, there is no ready explana- 
tion of the fact that the obtained data 
lie in each case on a single ROC 


function. 


CONCLUDING COMMENTS 


Of the independent variables which 
occur in the studies reviewed, only 
those concerned with test procedure 
are specifically handled by the theory. 
The other variables which have been 
mentioned (length of list, type of 
stimuli, and procedure) are indifferent 
with respect to the theory. That is, 
they may or may not be effective 
variables of (Xo — Xy) and (K). In 
this sense, the present theory is con- 
cerned with recognition-memory per- 
formance rather than learning and 
retention. per se. Nevertheless, in 
describing the end product of learning 
and retention, the theory has implica- 
tions for any description of that 
learning upon which recognition re- 
sponses are based. On a broader 
scale, the great similarity of the 
present theoretical assumptions to 
those which have been found to be 
adequate in psychophysical situations 
suggests that the essential features of 
this theory might, in time, be shown 
onlv a special application 


to be iti 
broad theory of decision 


of a very 

behavior. 
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HEDONIC ORGANIZATION 
ANIZ: N AND REGUL 
OF BEHAVIOR? nen 


Paut THomas Youxc? 


University of Illinois 


a ee eaae review supplementing previous articles upon 
c processes in motivation and learni Speci: 
emphasis is placed upon the oreanizi sy ee 

a ganizing and regulatory functions of 
espe in quin p iege ii IRURE from the author's laboratory 
E gj heoretical significance considered. Tt - 

i x : g d 5 hese ex- 
gm deal with hedonic processes as related to gustatory stimula- 
ion, food preference and intake, regulation of ingestive behavior, 


development of incentive values, and related matters. 


The main 


topics considered. are: basic concepts and definitions; psychophysical 
studies of hedonic processes; studies of thresholds, preference, and 
the regulation of food intake; physiology of affective processes; the 
role of affective processes in motivation and learning; general con- 


clusions. 


In a previous paper (Young, 1959) 
I reviewed experiments based upon the 
hypothesis that affective processes have 
an objective existence within the nerv- 
ous system. The present paper re- 
ports further research within the 
framework of experimental hedonism. 
It brings up to date the account of find- 
ings and hypotheses, Earlier theoreti- 
cal papers and bulletins are available 
(Young, 1941, 1948, 19492, 1952, 1955, 
1957), 


Bastc CONCEPTS AND DEFINITIONS ? 


Operational Definitions 


Food acceptance is a complex proc- 
ess regulated by at least four groups 
of determinants: (a) Organic condi- 


l'This paper is an outgrowth of research 
supported by National Science Foundation 
Grants G-14538 and GB-215. I am indebted 
to W. E. Kappauf for reading this paper and 
making many helpful suggestions. 

2 Retirement address: 336 Notre Dame 
Road, Claremont, California 91712. 

3 Throughout this paper I will use, for 
simplicity and convenience, the symbols and 
specifications employed in recent experi- 
ments as follows: CHO — sucrose; NaCl 
= sodium chloride; Q = quinine hydrochlo- 
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tions, including metabolic needs pro- 
duced by specific deprivations, and spe- 
cial organic states such as pregnancy, 
lactation, experimental states induced 
by surgical operations, implanted elec- 
trodes, drugs, etc.; (b) peripheral 
stimulations of the head receptors 
(taste, smell, touch, vision, audition) 
including environmental stimulations 
from the surroundings of a food. Per- 
haps under this heading one can include 
ecological factors such as availability 
of a foodstuff in the environment, phys- 
ical condition of food, and human so- 
cial determinants—economic conditions, 
religious taboos, esthetic factors, verbal 
suggestions, etc.; (c) previous experi- 
ence, including the dietary history of an 


ride; H=tartaric acid; Sac = sodium sac- 
charin; H:O = distilled or deionized water. 
Unless otherwise indicated, concentrations 
will be expressed as grams of solute dis- 
solved in 100 cubic centimeters of solution. 
Thus 4 CHO designates a simple sucrose 
solution containing 4 grams of sucrose per 
100 cubic centimeters of solution; .01 Q+ 
1.59 CHO designates a compound solution 
containing .01 gram of quinine hydrochloride 
plus 1.59 grams of sucrose per 100 cubic 
centimeters of solution (see Pfaffmann, 
Young, Dethier, Richter, & Stellar, 1954). 
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organism, the feeding habits formed, 
attitudes, motives, expectancies, and 
similar dispositions; (d) bodily con- 
stitution, or the biochemical make-up of 
organisms which differs from individ- 
ual to individual and from species to 
species. It is known, for example, that 
cats do not taste and discriminate sugar 
solutions and that birds are weak in 
the so-called “sweet tooth.” 

Referring to these groups of deter- 
minants we define drive, appetite, spe- 
cific hunger, general hunger, etc., as 
organic motivations regulating behavior 
when the other groups of determinants 
are held constant. We define palata- 
bility as the hedonic characteristic of 
a food dependent upon stimulation of 
the head receptors by a foodstuff and/or 
its surroundings when other determi- 
nants are held constant. We define 
food habit as food acceptance deter- 
mined by previous experience rather 
than by the other conditions. We speak 
of constitutional determinants of food 
acceptance and study individual and 
species differences when other factors 
are controlled. Basic concepts can 
thus be defined operationally. 

The terms acceptance and acceptabil- 
ity are neutral and carry no subjective 
stigma, These terms refer to observed 
ingestive behavior and are free from 
bias as to conditions that determine 
food intake, 


The Concepts of Choice and Preference 


A distinction should be drawn be- 
tween choice and preference. Choice 
implies that there are two or more 
possible courses of action for which 
an organism is primed, Preference is 
a form of choice which carries the ad- 
ditional implication that one course of 
action is better than another, Prefer- 
ence is an evaluative concept implying 
that Response A is more acceptable, 
more desirable, more esteemed, more 
valued, better liked, more pleasant than 
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Response B. A choice may be made 
without evaluation since many factors 
determine choice. 

A serious question can be raised as 
to the extent to which affective proc- 
esses are actually involved in choice. 
According to Lloyd Morgan’s canon, 
we should not postulate affective de- 
terminants of choice if choice can be 
explained completely in terms of sen- 
sory stimulations, reflexes, and habits. 

The necessity of distinguishing be- 
tween habit and palatability as separate 
determinants of choice came to light in 
an early experiment by Young and 
Chaplin (1945). They observed two 
incompatible choices in quick succession 
with the same rats. On the standard 


preference tester the rats showed a per- , 


sisting preference of sugar to casein. 
This was the preference that originally 
developed under need-free conditions. 
The preference persisted, however, dur- 
ing a period of steadily increasing pro- 
tein starvation. When the protein- 
starved rats were tested on a different 
kind of apparatus, they promptly de- 
veloped a new and incompatible pref- 
erence (casein > sugar). Now the 
Concept of two simultaneous but in- 
compatible preferences does not make 
sense. So control experiments were 
made (Young, 1945). In the light of 
the control experiments T interpreted 
the persisting sugar preference in the 
original experiment as a preferential 
habit that did not correctly reveal the 
changed palatability relations, and I 
concluded that preferences tend to de- 
velop in agreement with metabolic need 
states, but preferential habits pers 
independent determinants of choice 
even though the choice is unwise— 
contrary to metabolic needs, 

It is clear from such evidence that 
choice has various determinants. One 
of these is the hedonic outcome of a 
response. T have postulated on sound 
grounds that hedonic processes regu- 


pu 
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late contemporary behavior according 
to the principle of maximizing the posi- 
tive and minimizing the negative, and 
tar hedonic processes organize neural 
dispositions which persis E reguly 
behavior as dears PA e Lore 
ot choice. The hedonic outcome of a 
choice regulates the development of 
preferential discriminations. For ex- 
ample: A naive rat approaches the 
choice point on a T maze: he can turn 
right or left. On Trial 1 he may turn 
cither way depending on a variety of 
Predisposing factors, but after he has 
experienced the taste of sugar in the left 
Seal box and the taste of casein (or 
perhaps nothing) in the right goal box. 
the Ptobability that he will turn left on 
Successive runs increases approaching 
the asymptote of 100% left turns. We 
conclude that the vit prefers sugar to 
casein. The preference is revealed by 
the increasing probability of a left turn 
Which we have called the preferential 
trend, 

Therefore, it is clear that preference 
has at least two meanings. First, pref- 
erence is a functional variable—an im- 
mediate evaluation based in part upon 
affective processes and revealed in a 
single trial. Second, preference is a dis- 
positional variable—an acquired habit 
of discrimination that develops with 
Tepeated choices, These two meanings 


are closely interrelated: Affective proc- 
S ositions 


esses organize persisting disp 
Which in turn regulate choice. A spe- 
cific choice depends upon existing 
habits as well as concomitant hedonic 
Processes and possibly other factors. 


Experimental Methods with and with- 
out Choice 

The single-stimulus method was pro 
Posed by Beebe-Center, Black, Hoff- 
man, and Wade (1948) as a technique 
for studying “preference.” The method 
has many useful applications in studies 
of food acceptance, but it is not a true 


preference method because it excludes 
the possibility of choice. The single- 
stimulus method and the running meth- 
ods clearly reveal differences in per- 
formance. 

The single-stimulus method was used 
by Weiner and Stellar (1951) in study- 
ing acceptance rates of NaCl solutions. 
They found that naïve rats gave the 
normal "preference-aversion" (accept- 
ance-rejection) curves with a maximal 
rate of acceptance at .8 NaCl and an 
"nversion point" (indifferent concen- 
tration) at about 1.5 NaCl. The sub- 
jects did not regulate intake to obtain 
a constant quantity of NaCl, as re- 
quired by the homeostatic doctrine, but 
the quantity ingested varied with NaCl 
concentration. These findings with the 
single-stimulus method agree with re- 
sults obtained by methods that involve 
choice. 

Young and Greene (1953) found 
that with a single-stimulus method rats 
ingested more 9 CHO than 36 CHO 
during a 1-hour drinking test, but when 
offered a choice they ingested more 36 
CHO. Also on the brief-exposure 
preference tester the subjects revealed 
a clear and consistent preference of 36 
CHO to 9 CHO. Hence preferences 
cannot always be predicted correctly on 
the basis of results obtained with the 
single-stimulus method, and a distinc- 
tion must be drawn between preference 
and intake. i 

Carpenter (1958) offered rats a 
choice of CHO and NaCl solutions 
a two-bottle and an eight-bottle 
technique. He found that the pattern 
of intake differed with the method of 
tion. When the rats had choice 
ht CHO solutions, the data 
showed that absolute and relative in- 
take varied directly with concentration. 
Carpenter described this as a apr 
preference” curve, but it is nothing 
more nor less than the ordering of 
CHO solutions according to relative 


with 


presenta 
among elg 


62 PauL THomMas YOUNG 


palatability as revealed by other fied 
ods of testing preference. With NaCl 
solutions the eight-bottle technique 
shifted relative intake toward the lower 
rations. 
prc and Chaplin (1949) used an 
eight-bottle technique in presenting 
NaCl solutions within a self-selected 
diet that contained many other aqueous 
solutions. They found maximal intake 
at .7 NaCl (actually .75 NaCl) which 
is very near to the known optimal NaCl 
concentration. With an eight-bottle 
technique the pattern of intake agreed 
with the well-authenticated acceptance- 
rejection function for NaCl solutions, 
and, incidentally, the pattern of relative 
acceptability was the same for normal 
and adrenalectomized rats. The total 
diet was a factor in their result. 
Hence it is clear that a result de- 
pends upon whether or not the possi- 
bility of choice exists and upon the 


number of bottles of fluids presented 
for choice, 


Main Characteristics of Our Research 


Methods 


Our recent methods of research have 

the following characteristics: (a) The 

stimuli have been simple and compound 

taste solutions. Solutions have a big 

advantage over natural foods in that 

precise quantitative control of concen- 

tration (hedonic intensity) is possible, 

and, further, the interactions of solutes 

with positive and negative valences can 

be studied. (b) The work has been 

done mainly with need-free (nonde- 
prived) rats. It has been found that 
need-free subjects are more ready to 
discriminate among test foods than 
partly starved, undernourished animals. 
Further, need-free subjects yield quan- 
titative norms of ingestion and prefer- 
ence that can be used in various studies 
of dietary depletions, surgical opera- 
tions, special organic states such as 
pregnancy, species comparisons, etc. 


(c) Our methods have offered rats a 
free choice between test fluids that are 
briefly exposed. Brief-exposure tech- 
niques give a cross section of the pref- 
erential hierarchy at the time of the 
test and make it possible to distinguish 
between such phenomena as acquisition 
of appetitive and aversive behavior, on 
the one hand, and approach to organic 
satiation, on the other. (d) The up- 
and-down psychophysical method yields 
an objective and quantitative measure 
of palatability if one uses the sucrose 
scale as the scale of measure. This 
method is described below. 


PSYCHOPHYSICAL STUDIES or 
HEDONIC PROCESSES 


The Up-And-Down 
Method 


Psychophysical 


Prior to introduction of the up-and- 
down psychophysical method we were 
limited to preferential hierarchies or 
transitive series, such as the following 
(Young, 1949b, p. 40) : 50 CHO > 4 
CHO > .5 NaCl > H,O > 3 NaCl. 
Such series do not reveal the difference 
in palatability between any two mem- 
bers of the series. The up-and-down 
method, however, yields an equivalent 
CHO concentration for each member 
in the series and thus provides infor- 
mation about the relative spacing of the 
members along the hedonic continuum. 

The up-and-down method was intro- 
duced to our laboratory by W, E, Kap- 
pauf, beginning with the doctoral re- 
Search of Christensen (1962) and 
extending to current studies, The 
method has been employed in experi- 
ments by Kappauf, Burright, and De- 
Marco (1963), Young and Schulte 
(1963), and Young and Madsen 
(1963). These papers should be con- 
sulted for technical details. In general, 
we have followed the computational 
procedures outlined by Dixon and 
Massey (1957). 


| 
" 


| 
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he up-and-down method rests upon 


, the principle that when rats are offered 


ie ee AW sucrose solutions 
centration T Higher (sweeter) sons 
anly ty a j i principle applies not 
compounds kh. -— solutions but to 
one ae that contain sucrose plus 
Aitinise aoe such as salt or 
E "n! 3 palatability of a simple 
dn solution can be raised by 
ihe mc. and lowered by decreasing, 
be doigts concentration. It should 
lee d €d out, however, that palatabil- 
as defined by preference tests is not 
= Same as intake or rate of licking. 
Maximal intake and maximal activa- 
tion of the tongue occur at approxi- 
mately 8.5 CHO, , 

The up-and-down method was first 
applied to testing groups of rats. Or- 
dinarily a group of 30 subjects can be 
tested during a single session. From 
the data one can compute for the group 
75 a group the mean isohedonic con- 
centration, the SD of the sample, and 
the SE of the mean, as described by 

ixon and Massey (1957). Thegroup 
Method yields a measure of hedonic 
Equivalence in a single test. Since the 
Choice of one subject determines the 
Preference test to be given to the next 
Subject, the method reduces to a mini- 
mum the effects of experience. 

An obvious limitation of the group 
up-and-down method is that it does not 
give measures of hedonic equivalence 
for individual subjects and hence yields 
no information about individual differ- 
ences, Young and Madsen (1963), 
however, demonstrated that the up- 
and-down method can be applied to 
testing individuals. Their technique, 
believe, is the best available method for 
testing taste preferences of indivi uam 
Subjects, and I would strongly recom- 
Mend it to anyone E^ ocn] a 
Search in this area. Young and Madz 
Sen had no difficulty wor 
Choices per rat in a single session. 


With individual tests the choice during 
one trial determines the pair of test 
fluids to be presented in the next. 
Their criterion of preference was con- 
tinuous drinking of one fluid for 5-10 
seconds immediately aíter forced sam- 
pling of first one and then the other 
fluid, followed by a period of free 
choice. 


Palatability Ratings with the Sucrose 
Scale 

Since need-free rats, when offered a 
choice, prefer the higher (sweeter) 
concentration of two sucrose solutions, 
it is possible to use sucrose concentra- 
tion as a basis for scaling levels of 
acceptability of fluids and solid foods. 

Hedonic scaling is illustrated in Fig- 
ure 1 which shows the hedonic equiva- 
lence of solutions of sodium saccharin 
and sucrose as determined by Young 
and Madsen (1963). In collecting the 
data presented in this curve we selected 
the seven standard concentrations of 
sodium saccharin shown along the base 
line of the figure. For each standard 
eight rats were tested individually with 
the up-and-down method to determine 
the mean sucrose equivalent concentra- 
tion. For each test the formula was 
K Sac versus X CHO. The results of 
a test with X varied up and down 


established : 
K Sac = X CHO 


in which K is constant throughout a 
test (eg., -40 Sac), and X is the mean 
CHO concentration at hedonic equal- 
ity. The symbol = is here introduced 
to represent hedonic equality. ae 
Vertical lines in Figure 1 indicate 
the range of means for the eight sub- 
jects. The solid curve shows the mean 
hedonic-equivalence function for the 
group as a whole. On the basis of this 
curve we estimated the optimal con- 
centration at .5 Sac and that this opti- 
mum is hedonically equivalent to a 
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CONCENTRATION SACCHARIN IN GM/IOO ML SOLUTION 


The hedonic-equivalence function for simple solutions of sodium saccharin and 


sucrose. (From Young & Madsen, 1963.) 


Sucrose concentration of 3,5 CHO. 
Incidentally, human observe 
that saccharin solutions are sweet and 
pleasant up to about .5 Sac and that 
higher Concentrations become sharp 
(bitter?) and unpleasant à; 


TS report 


With suitable modifications of the 
apparatus it would be possible to ob- 
tain objective palatability ratings of 
solid foods. Young and Greene (1953) 
ran some tests between solid crystalline 
sucrose and a solution of 36% sucrose, 
They found that the solution was pre- 

ferred by rats to the solid sucrose. We 
did not use the up-and-down method 
at the time, but there can be no doubt 
that preference tests between solids and 
liquids can easily be made. 

Because rats prefer the sweeter solu- 
tion to the less sweet, we must not 
jump to the conclusion that a sucrose 
scale is suitable for hedonic ratings with 
human subjects. Jerry W. Koppman, 
in an unpublished study under the di- 
rection of W. E. Kappauf, found that 


some human subjects (like the rats) 
consistently preferred the sweeter solu- 
tion, except that 64 CHO (a highly 
viscous, sticky syrup) was the least 
acceptable fluid for all subjects. Other 
subjects tended to prefer the less-sweet 
solution especially when concentrations 
were above about 10 CHO. For the 
group as a whole 16 CHO v the 
most acceptable concentration, but t here 
were individual differences in the opti- 
mum. Koppman concluded that the 
up-and-down method with a CHO 
Scale is not suitable for palatability rat- 
Ings with human subjects. 


Tsohedons 


Guilford (1939) proposed that the 
term isohedon be employed to desig- 
nate a contour line of equal affective 
value on a stimulus surface such as à 
triangular surface with white, black. 
and yellow at the corners. 


The term 
was coined by an 


alogy to contour lines 
of equal barometric pressure (isobars) 
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on weather maps and the like. Subse- 
quently Guilford (1954) published a 
family of isohedons plotted on a surface 
defined by the frequency and intensity 
of auditory stimuli. 

With gustatory stimuli a stimulus 
area can be defined by specifying the 
range of concentrations for two solutes, 
both of which are present within a 
compound solution. .V stimulus area 
represents an infinite number of com- 
pound solutions containing two solutes. 
If three or more solutes were present 
in a compound solution, a tridimen- 
sional or multidimensional space would 
be necessary to represent all possible 
compounds. To date, however, we 
have worked only with mixtures con- 
taining two solutes. "uu 

The following general equation 1s 
used in plotting an isohedon, for ex- 
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sucrose- 


ample, in the CHO-NaCl stimulus 
area: 


h 
K CHO=X CHO+C NaCl 


in which K and C 
throughout a particular test, and X is 
the mean concentration that makes the 
compound solution and the sucrose 
standard isohedonic, For example, in 
determining a point on the 4 CHO iso- 
hedon at the level of 1 NaCl we would 
hold to these values in a test: 4 CHO 
versus X CHO + 1 NaCl. From 4to6 
values of N, varied up and down from 
trial to trial, are ordinarily used in a 
test. These values are separated by 
equal log distances and are determined 
in advance with a group of pilot 
animals. 

Figure 2 is an isohedonic contour 
map in the CHO-Q stimulus area, It 
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illustrates features common to isohe- 
donic maps in other stimulus areas: 
(a) Most but not all isohedons dip 
below the level of the standard CHO 
concentration (shown in Figure 2 by 
heavy numbers at the left) and also 
rise above the standard level, In this 
particular study the dips are indicated 
by + signs to show that low concen- 
trations of Q enhance the palatability 
of the mixture, facilitating ingestion, 
and the tails of the isohedons are indi- 
cated by — signs to show that higher 
concentrations of Q impair palatability, 
inhibiting ingestion. The + and — 
signs imply that there is a bipolar con- 
tinuum extending from positive to 
negative hedonic values. And, further, 
the signs show that it is possible to 
study the combination of solutes with 
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positive and negative valences. (b) In 
the four stimulus areas that have been 
mapped to date (CITO-NaCl, CHO-O, 
CHO-H, CHO-Sac), low concentra- 
tions of the nonsucrose variable have a 
positive or indifferent hedonic influ- 
ence, facilitating ingestion, and high 
concentrations have a negative hedonic 
influence, inhibiting ingestion. (c) 
The isohedons, in general, tend to run 
similar and roughly parallel courses. 
They do not intersect each other. This 
is exactly what we might expect if the 
curves in a family represent genuine 
isohedons, since the higher CIHO con- 
centrations are known to be preferred 
by rats to the lower. (d) Isohedons 


leave the stimulus area at relatively 
high concentrations of the nonsucrose 
variable. 


This fact implies that there 


ISOHEDON | cno] 
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CHO (right) in the sucrose-tartaric acid stimulus na qad part of Isohedon 1 


Schulte, 1963.) 
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is a limiting concentration of the non- 
sucrose variable at which no amount 
of sugar in the compound solution can 
make it acceptable. Rough and pre- 
liminary approximations of the upper 
rejection limits for need-free rats are 


6-8 NaCl, .3-.4 Q, 3-4 H, 3 Sac. 
The Locus of Hedonic Indifference 


If an isohedon dips below the stand- 
ard level at low concentrations and 
rises above the standard level at higher 
concentrations, there is implied an in- 
different concentration which neither 
enhances nor impairs palatability but 
which has a neutral influence. At the 
point of indifference the standard and 
comparison solutions are isohedonic 
but of different sensory quality. 

In a preliminary survey of the 
sucrose-tartaric acid stimulus area 
Young and Schulte (1963) determined 
concentrations of hedonic indifference 
at five CHO levels. The result is pre- 
sented in Figure 3. Each indifferent 
concentration Was determined by the 

ersus K CHO + X H. 


test: K CHO v 
Hedonic equality is represented by the 


general equation: 


K CHO=K CHO+XH 


in which K is a sucrose concentration 
held constant throughout a test and 
equal in the standard and comparison 
solutions, and X is the mean acid con- 
centration that makes the comparison 


solution hedonically equal to the 

standard. = i 
The horizontal arrows indicate the 
ations used in 


range of acid concentr ; 
each test. For each range all subjects 
preferred the compound (tart) solu- 
tion to the standard with concentra- 
tions shown at the extreme left; all 
subjects preferred standard to com- 
pound (sour) solution with concentra- 
tions at the extreme right. The line 
connecting the mean indifferent con- 


centrations divides the area into two 
regions. To the left of the locus of 
indifference the acid concentrations en- 
hance palatability; to the right of the 
locus of indifference the higher acid 
concentrations impair palatability. The 
line itself represents the locus of com- 
pound solutions in which tartaric acid 
has a neutral hedonic influence. 
Every point upon the locus of in- 
difference is also a point upon an iso- 
hedon with a standard at the same 
CHO level. This is illustrated by the 
curve at the right, Figure 3, which is 
part of Isohedon 1 CHO. The points 
on this isohedon were determined by 
solving for X in this test: 1 CHO ver- 
sus K CHO+X H. The general 


equation is: 


h = 
1 CHO=K CHO+XH 


in which K is the concentration used to 
determine a point on the locus of in- 
difference, and X is the mean concen- 
tration of H at hedonic equality. This 
is part of Isohedon 1 CHO because it 
does not extend into the region where 
low concentrations presumably would 


enhance palatability. 


Hedonic Summation and Interaction 


If hedonic processes summate alge- 
braically, the addition of NaCl to a 
CHO solution should enhance palata- 
bility when the NaCl concentration is 
in the range of acceptance and impair 
palatability when the NaCl concentra- 
tion is in the range of rejection, Such 
algebraic summation does, in fact, oc- 
cur. Young and Christensen (1962) 
found that the addition of 75 NaCl 
(near optimal) to a sucrose solution of 
5, 1, or 2 CHO made the compound 
preferred to a simple sucrose standard 
of the same concentration. And the 
ds were also preferred to a 
ard of .75 NaCl. The im- 
palatability by addition of 


compoun 
simple stand: 


pairment of 


68 


4 NaCl (hedonically negative) was 
easily demonstrated. u . 
The summation of positive hedonic 
intensities did not appear with concen- 
trations of 4 CHO and 8 CIIO, but 
subsequently with another method 
Christensen (1962) demonstrated he- 
donic summation at the higher CHO 
levels. He attributed the success of 
the second experiment to an improve- 
ment of method. Specifically, the im- 
provement provided: (a) forced pre- 
liminary sampling of both fluids, (b) 
utilization of spatial cues as aids to 
discrimination, (c) the up-and-down 
psychophysical method. 

Algebraic summation of positive he- 
donic intensities has also been demon- 
strated in the CHO-Sac stimulus area. 
Young and Madsen (1963) found that 
CHO-Sac isohedons made a much 
bigger dip when .5 Sac was present 
than isohedons in the CHO-NacCI stim- 
ulus area. This phenomenon is under- 
Standable as simple algebraic summa- 
tion since the optimal Sac concentra- 
tion is equivalent to 3.5 CHO (see 
Figure 1) while the optimal NaCl con- 
centration is equal to approximately 
1 CHO. 

The concept of algebraic summation 
breaks down, however, in the CHO-Q 
stimulus area, Kappauf et al, (1963) 
pointed out that neither their own work 
nor the experiments of Patton and 
Ruch (1944), Benjamin (1955), and 
Young, Burright, and Tromater (1963) 
found any evidence that simple quinine 
solutions at any concentration have 
positive hedonic value, All quinine 
solutions are rejected at threshold and 
higher concentrations. Nevertheless, 
low concentrations of Q combined with 

high concentrations of CHO produce 
mixtures that are preferred to simple 
CHO solutions of equal concentration 
(see Figure 2). The hypothesis of 
algebraic summation would require that 
the addition of a solute with negative 
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valence would impair rather than en- 
hance palatability. Consequently we 
must postulate that the addition of O0 
to CHO produces an interaction that 
differs from simple algebraic summa- 
tion. 

The nature and bodily locus of this 
interaction are unknown. Interaction 
could occur on the tongue in the pres- 
ence of saliva: The ions of a compound 
solution could have interrelated stimu- 
lating effects upon the gustatory re- 
ceptors and thus influence the total 
nerve discharge. Or interaction could 
occur in the nervous system at the 
midbrain level where primary affective 
processes appear to originate. The 
where and how of hedonic interactions 
and their relation to simple algebraic 
summation are matters to be investi- 
gated in further experiments, 


Diagnostic Tests of Individual Differ- 
ences in Affective Responses 

ligure 4 shows four 
the CHO-NaCI stimulu 
standard of 1 CHO, 
hedon, plotted with 
based upon the data 
Schulte (1963) for 
tested with the group up-and-down 
method, The other isohedons, plotted 
with solid lines, show curves based on 
tests of Young and Madsen 


isohedons in 
5 area with a 
The lowest iso- 
broken lines, is 
of Young and 
a group of 30 rats 


: i (1963) 
with eight rats tested with the indi- 
vidual up-and-down method, — The 


heavy line is the mean isohedon for 
the eight rats, the light lines show 
isohedons for two individuals: Rat 3 
and Rat 8, 

It will be observed that the mean 
isohedon for the rats tested individually 
is higher in the field than the isohedon 
based on data obtained with the group 
method. Just why this difference ex- 
ists Is not clear, but each point on an 
individual isohedon is based upon 75 
choices Per rat (25 per day for 3 days), 
while each point on the group isohedon 
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sohedons in the sucrose-sodium chloride stimulus arca. 


Fic. 4. Group and individual i 
(Fro 


1s based upon one choice per rat on à 
single day. I believe that the greater 
amount of experience in the individual 
tests accounts for the difference be- 
tween the two curves, but further study 
of this matter is needed. 
More interesting is the ¢ 
of curves for individual subjects. 
isohedon for Rat 3 is at all 
higher in the field than that for Rat 8. 


Also the curve for Rat 3 does not dip 
significantly below the 1 CHO standard 
level while the curve for Rat 8 shows 
the typical dip. Moreover, we found 
that in isohedons with standards of 4 
CHO and 16 CHO, Rat 3 was con- 
Sistently the “high” animal and Rat 8 
Consistently the “low” animal. There 


lisplacement 
The 


points 


m Young & Madsen, 1963.) 


is an obvious parallel displacement of 
aedons for individual rats. There 
is, of course, overlap, but curves for 
some subjects are consistently above 
in, for others, consistently below 
for others, consistently 


isol 


the mea 
the mean, 

average. 
This finding of parallel displacement 
ual isohedons is theoretically 
and practically important. Tt implies 
lons reveal consistent indi- 


that isohec 
vidual differences in affective responses 


when tested at any or all points within 
stimulus area. Practically this 
that in developing diagnostic 
dividual differences in affec- 
Jo not need to plot total iso- 
ndividual differences can be 


of individ 


the 
means 
tests of in 
tivity we ¢ 
hedons. I 
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tested at a single point within the stim- 
ulus area. Or (even better) diagnostic 
tests can be made by getting hedonic 
ratings of simple or compound solu- 
tions with a CHO scale. For example, 
in studying conditions that change the 
level of acceptability of NaCl one might 
obtain ratings of 1 NaCl on a sucrose 
scale, or perhaps ratings of a com- 
pound such as 1 NaCl + 4 CHO, and 
observe how these ratings changed with 
adrenalectomy, sodium depletion, and 
other conditions. 

The interpretation of individual dif- 
ferences in tests of affective responses 
is another matter. The difficulty can 
be illustrated by reference to Figure 4. 
One can argue that Rat 3 is less sen- 
sitive than Rat 8 to sucrose stimula- 
tions since he requires more CHO in 
the compound to produce hedonic 
equality with the standard; Rat 3 has 
less of a “sweet tooth" than Rat 8. 
But one can equally well argue that the 
difference lies in the affective response 
to NaCl. Rat 3 may react against 
NaCl more strongly than Rat 8 and 
hence require more CHO in the com- 
pound to counteract a built-in aver- 
siveness, I am sure that this difficulty 
of interpretation can be overcome by 
further work and that diagnostic tests 
of individual differences in affective re- 
sponses are entirely feasible. 


Preference 
Tongue 


Young and Schulte (1963) demon- 
strated that a distinction must be 
drawn between preference and activa- 
tion of the tongue. Their data re- 
vealed two interesting facts: First, the 
amount of licking a physically constant 
standard solution varied with the com- 
parison solution that happened to be 
paired with it during a preference test. 
If the comparison solution was of high 
palatability, the standard solution was 
licked at a higher rate (licks/minute) 


and Activation of the 


than if the comparison solution was of 
lower palatability. There was thus a 
gestalt effect that influenced intake. 
Second, the total amount of licking 
both fluids (standard plus comparison) 
varied with the location of the com- 
parison solution in the stimulus area. 
A striking illustration of this was 
found in the CHO-Q area. With a 
standard of 1 CHO the mean numbers 
of licks of both fluids (standard plus 
comparison) are shown in the follow- 
ing tabulation: 


Concentration of Q in the 
comparison compound 


.00001 .0001 .001 01 4 


Mean number of licks with 
both fluids 


207 162 141 113 47 


It is obvious at a glance that the com- 
pound solutions represented by points 
along an isohedon are not equal in 
their potential for activating the lick- 
ing response, 

Preference is a phenomenon which 
can occur at any level of tongue activa. 
tion. Viewed from this angle, an iso- 
hedon appears to be a series of points. 
Each point represents hedonic equality 
between a physically constant standard 
and a comparison solution tested at a 
fixed location within a stimulus area. 


Activation with 
Stimuli 


Single Gustatory 


To study tongue activation further 
Young and Trafton (1964a) recorded 
rates of licking single solutions ( simple 
and compound) exposed for two suc- 
cessive 60-second periods. They ex- 
plored the same four stimulus areas 
that had Previously been mapped with 
isohedons so that activity data with 
single stimuli could be compared di- 
rectly with preferences elicited by 
paired stimuli, Samples of their find- 
ings are presented in Figures 5 and 6. 
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the sucrose-sodium chloride stimulus area. 


ats. From Young & Trafton, 1964a.) 

Fine vertical lines give rates of licking 
at the 20 points where tests were made. 
The solid lines describe a tridimen- 
sional surface based upon mean rates 
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of licking during the first 60-second 
exposure. Broken lines show rates of 
licking during a second 60-second ex- 
posure that followed after a pause of 
30 seconds or more. : 

Figure 6 is an activity contour map 
of the CHO-Q stimulus area. Num- 
bers on the contour lines indicate mean 
numbers of licks per 60 seconds during 
the first exposure, Each line on the 
surface is an isoacton—a curve repre- 
senting simple and compound solutions 
that elicit the same amount of licking. 

The following generalizations are 
based upon a study of tridimensional 
activity surfaces and activity contour 
maps in four stimulus areas: (a) The 
highest rates of licking occur where a 
sucrose concentration of 8 CHO is 

present alone or in combination with 
a relatively low concentration of the 
nonsucrose component. The lowest 
rates of licking are found where the 
lowest sucrose concentration (1 CHO) 
is combined with the highest nonsu- 
crose concentration. (b) In all four 
stimulus areas where the concentration 
of the nonsucrose component is in the 
region of negative affectivity (as de- 
fined by isohedons and by the loci of 
indifference) it is possible to find cer- 
tain concentrations of the nonsucrose 
component which yield rates of licking 
having a direct monotonic relation to 
CHO concentrations in the compound 
solutions, For example, observe in 
Figure 5 the relative rates of licking 
compounds that contain 4 NaCl. (c) 
In the CHO-NaCl and CHO-Sac stim- 
ulus areas (but not in the CHO-O and 
CHO-H areas) the highest concentra- 
tions of the nonsucrose component have 
a fairly uniform inhibitory influence 
upon the licking response at all levels 
of CHO concentration. In Figure 5, 
for example, compare the activity gra- 


! The term isoactom was coined by Young 
and Trafton (1964b) to match Guilford’s 
(1939) term isohedon. 
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dients at 8 NaCl and 4 NaCl. (d) In 
the CHO-NaCl and CHO-Sac stimu- 
lus areas there is an indication of alge- 
braic summation of positive hedonic 
intensities at the level of 1 CHO but 
not at higher CHO concentrations. 
Figure 5, for example, shows a higher 
rate of licking at 1 CHO +1 NaCl 
than at 1 CHO +O NaCl. (e) In 
most tests the mean rate of licking is 
lower during the second 60-second ex- 
posure than during the first, but in the 
CHO-NaCI area there are a few in- 
stances of a “warming-up” effect, that 
is, slightly higher activity during the 
second exposure than during the first. 
This effect appears where relatively 
low NaCl concentrations combine with 
relatively high CHO concentrations, 


Isohedons and Isoactons 


Isohedonic contour maps and activity 
contour maps are totally different kinds 
of graphic representations, It is im- 
possible to predict the course of iso- 
hedons from isoactons 
Some points of 
below. 


and vice versa. 
divergence are noted 


From the activity charts one cannot 
predict the uniform preference of 32 
CHO and 8 CHO. On the basis of 
amount of licking one would predict 
little difference in acceptability between 
these two fluids or that 8 CHO would 
be preferred to 32 CHO. Young and 
Trafton (19642) found that the rate 
of licking 8 CHO was slightly and 
consistently higher than the rate of 
licking 32 CHO, thus confirming 
Young and Greene (1953), Moreover, 
the difference in rate of licking cannot 
be attributed to the approach of satia- 
tion since the difference in rate was 
observed with initial exposure times as 
brief as 10 seconds, 

Again, all activity contour charts 
(except that for CHO-NaCl) reveal 
secondary peaks of high activity in 
places that have no relation to the iso- 
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hedons. This point is illustrated in 
Figure 6 which should be compared 
with the isohedonic contour map of 
the same area shown in Figure 2. The 
isohedons indicate a maximal enhance- 
ment of palatability at .00003 O + 40.3 
CHO, but the maximal rate of licking 
is located at about .00001 Q + 8 CHO. 
Further, the isohedons give no indica- 
tion of the secondary peak of activity 
shown in Figure 6 at .001 Q + 8 CHO. 
‘Therefore, as noted above, preferential 
discrimination is one thing. activation 
of the tongue (intake) is something 
elsc. 

It is scarcely necessary to add that 
the activity charts give no objective 
basis for distinguishing between. posi- 
tive and negative valences, but this dis- 
tinction is clearly indicated by isohe- 
dons and the loci of hedonic indiffer- 
ence where there is a standard solution. 
Activity charts do show that increas- 
ingly high concentrations of the non- 
sucrose variable inhibit or retard lick- 
ing (ingestion) to the point of complete 
rejection of the compound, but there 1s 
no clear line of division between facili- 
tating and inhibiting influences. 


‘ities and Preference 


It is easy to assume (as I and others 
have erroneously done) that the most 
palatable food is also the best aie 
for eliciting instrumental activ ities. 


d :on is tr vithin 
This assumption is truce only W 
P 1 deal of con- 


limits, and there is a 00€ d 
tradictory evidence: Ina Ley 9e 
experiment Guttman (1953) foun tha 
with continuous reinforcement the per- 
formance level was higher at 16 
than 32 CHO. Young and Shuford 
(1955) found that the rats’ speed of 
locomotion down a 6-foot runway Was 
faster when the subjects approached an 
e of 18 CHO than one of 54 
plenty of evidence 


CHO con- 


Instrumental Acti 


incentiv I 
CHO. Yet there 15 p. 
that rats prefer the higher 


x 
w 


centration to the lower when offered 
a choice. 

In an early experiment (Young, 
1944) rats shuttled back and forth on 
the preference tester with an incentive 
of crystalline sucrose for one group 
and whole wheat powder for the other. 
There were large individual differences 
in running time, and performance of 
the groups overlapped so greatly that: 


The safest conclusion to draw is that present 
results do not afford a sound basis for any 
prediction of preference between sugar and 
wheat. The activity of running is one thing 
and food preference, as tested by our ap- 
paratus, is something else [p. 339]. 


In another experiment, Young, Falk, 
and Kappauf (1958) measured the 
time required by rats to approach a 
cup. 38 centimeters away from the 
starting box, which contained a solu- 
tion of .1, .3, .9, 27, or 8.1 NaCl. 
They found that the maximal rate of 
approach was to the 2.7 NaCl solution. 
Preference tests had previously shown 
that optimal palatability was close to 
9 NaCl. Hence rate of approach does 
not correspond exactly with palatability. 

In view of these and similar findings 
one should not expect to predict pref- 
erences infallibly from the performance 
of instrumental activities such as run- 
ning down a straightaway, bar press- 
ing, T-maze performance, etc., and vice 


versa. 
STUDIES OF THRESHOLDS 
Sensory and Preferential Thresholds 


Richter (1939). using a continuous- 
technique. reported 


exposure preference 
shold of rats for 


that the taste thre 
sodium chloride was lowered by ad- 
renalectomy. Ie believed that the 
1 for NaCl resulting from this op- 


neec nidos = 
sensitivity of gus- 


eration increased the 


tatory receptors. mans 
Richter’s (1939) claim instigated a 


number of experimental studies. One 
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Young and Shuford suggested that a 
minimal hedonic intensity is necessary 
for learning (improvement with repe- 
tition) to occur. The matter, obviously, 
needs further study. 


PREFERENCE AND THE REGULATION 
or FooD INTAKE 


Regulation of Food Intake by Organic 
Need States and Drives 


Deprivation of a substance necessary 
to maintain homeostasis frequently (but 
not invariably) develops a specific hun- 
ger, or appetite, for the needed sub- 
stance. For example, if rats are main- 
tained upon a dict deficient in protein, 
casein is preferred to sugar; but need- 
free rats prefer sugar (Young, 1941, 
1948). Again, Weiner and Stellar 
(1951) and Epstein and Stellar (1955) 
have argued that the need for the Na 
ion in adrenalectomized rats is the 
major determinant of increased NaCl 
intake. 

Recently Lewis (1960, 1964) showed 
that appetitive drives produced by 
adrenalectomy and parathyroidectomy 
have all the characteristics of general 
hunger and thirst drives, When ad- 
renalectomized rats were deprived of 
NaCl, they learned habits of bar press- 
ing and T-maze discrimination for re- 
wards of NaCl solutions. The strength 
of response varied with the duration 
of NaCl deprivation. The subjects also 
discriminated between solid food and 
NaCl solutions on the basis of their 
need state. 

A specific drive, or appetite, for cal- 
cium was produced by parathyroid- 
ectomy. This drive had all the char- 
acteristics of other appetitive drives. 
The calcium drive led to formation of 
instrumental habits that yielded a re- 
ward of calcium lactate. The strength 
of the calcium drive varied with the 
number of hours of calcium depriva- 
tion, and the depleted subjects showed 


a preference for concentrations that 
yielded more calcium. 

Hence, Lewis concluded, there are 
specific hunger drives for sodium and 
calcium, and intake is regulated by 
metabolic needs. 


Optimal Palatability and Excessive 
Intake 


A difficult problem for the homeo- 
static theory of regulation is to ac- 
count for excessive intake that occurs 
when NaCl solutions are presented at 
optimal concentration. Nelson (1947) 
reported that when rats had access to 
a solution of .8 NaCl they drank great 
quantities of it. Growth was retarded, 
the kidneys and other organs became 
enlarged, and there were other symp- 
toms of excessive NaCl intake. The 
rats took much more NaCl than needed 
to maintain homeostasis and to remain 
in sound health when sodium chloride 
was obtainable at a concentration of 


8 NaCl. 


This finding has been confirmed by 
Schmidt (1964) with rats injected 
with 40 milligrams/kilogram pheno- 
barbital and tested for NaC] acceptance 
with a one-bottle technique. He found 
a peak of acceptability with injected 
animals and also with the placebo con- 
trols when the concentration was 9 
NaCl. The injected rats: 


drink so much of the solution that they 
overload the stomach and gut and some oí 
the solution flows out the anus, These rats 
seem literally unable to withdraw from the 
drinking tube. Moreover, the anal outflow 
of solution scems unpleasant to the rat in 
that drinking is depressed for a couple of 
days thereafter. Whether it would occur 
after repeated exposures is not known at 
this time [p. 204]. 


Schmidt's (1964) graph clearly 
shows that the intake of NaCl solutions 
is a function of concentration; his ac- 
ceptance-rejection curve is similar to 
that established by previous workers 
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' (see Young & Falk, 1956). Since 
NaCl solutions at optimal concentra- 
tion lead to excessive intake of salt and 
with phenobarhiti injections, to exces- 
sive drinking, it is clear that internal 
homeostatic mechanisms do not fully 
account for the regulation of intake. 
Intake and taste preference are regu- 
lated by different, though interrelated, 
bodily mechanisms. 


Aversive Behavior and Satiation 

A distinction must be drawn between 
(a) aversive behavior based upon palat- 
ability, and (b) the checking of inges- 
tion by internal mechanisms as satia- 


tion is approached. 
Satiation is a terminal orga 
characterized by the fact that 
Ject, in a constant environment, NO 
longer accepts the kind of foo 
upon which he is satiated. Need-free 
rats are sometimes incorrectly de- 
scribed as “satiated.” If such subjects 
are offered a solution of sucrose sac- 
charin, or sodium chloride (which they 
do not need), they ingest quantities of 
it approaching the limit of satiation for 
the particular test food. There are 
states of partial and total satiation. In 
Partial satiation an animal satiated upon 
one kind of food will still accept an- 
other, the subject 
Will not accept any kind of 
liquid, li 
n Distinet from the appro 
ton is the immediate 


nic state 
the sub- 


d or drink 


fa 
bad taste. 


example, if rats are offered a 5 


of .1 Q, they do not drin 


away as soon as contact has beer p 
If they are offered 4 


With the fluid. 

Compound containing Q and bere A 

example, .] Q + 32 ČHO, they maj 

accept it, but with repeated experienc? 

they develop an aversion. — e 
Rats learn to like and dislike, to ? : 

cept and reject, to discriminate Ph 


*rentially, and this lear? 
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by the immediate hedonic effect of 
sensory stimulation. 


rIVE PROCESSES 


Pirysto.ocy or AF 


Receptor stimulation has three main 
physiological effects. The first is sen- 
sory. The classical projection sys- 
tems receiving inputs from various 
receptors mediate cognitive functions 
such as localization, discrimination, 
perception, memory, and thought. The 
second effect of receptor stimulation is 
activating. The reticular activating 
n is aroused through collaterals 


systet 
afferent nerves as well as by in- 


from 
puts from the cerebral cortex. The 
third effect is hedonic. Some arousals 


stimulation are positive 
and facilitating, others are negative and 
inhibiting, still others are neutral. 
There are built-in neural mechanisms 
that mediate primary evaluations in 
terms of biological "good" or "bad." 


bv receptor 


Septal Stimulation and Affective 
Processes 

Asdourian (1962) was the first, so 
far as I know, to combine an approach 
through experimental hedonism with 
the selí-stimulation technique of James 
Olds. He attempted to discover 
whether affective arousals produced 
by taste stimuli could be combined 
with self-stimulations of the septal area 
of the limbic lobe to influence choice. 
Specifically he asked: Can selí- 
stimulation through electrodes im- 
planted in the septal area of the limbic 
system make quinine solutions ac- 
ceptable? Can normal preferences 
among sucrose and quinine solutions be 
modified by brain shock ? . 

He arranged an apparatus in which 


ts were tested with single-taste 
» ids or with pairs of fluids presented 
D watus was so de- 


The appé as so 
contacts with either 


Jd produce a brain 


for choice. 
signed that tongue 
E h fluids cou 


or bot 
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shock through electrodes implanted 
within the septal area of the rat's brain. 
Prior to tests with taste solutions, 
Asdourian found that of 26 naive rats, 
12 responded actively by bar pressing 
that produced septal stimulation, 14 
responded so little that they were called 
“nonresponders” even though they re- 
ceived septal stimulation from bar 

oressing. 
i a found that when tongue 
contact with a fluid produced brain 
shock the rate of licking was reduced 
for all rats from normal (6-8 licks/ 
second) to 2-3 licks/second. Septal 
stimulation, therefore, retarded licking 
and ingestion, but it did not inhibit 
tongue contact with the fluid that pro- 
duced septal stimulation, The be- 
havior of the subjects gave the impres- 
sion of “bar pressing with the tongue” 
to produce septal stimulation! For 
example, when the rats were offered 
a choice between 4 CHO without brain 
shock and 8, 16, or 32 CHO plus brain 
cae be ingested more of the 4 
bras dh ution, presumably because the 
3 ock inhibited ingestion of the 


concentrations rather 


the fluids that pr 
tion but consum: 
might say that t 
septal stimulation to drinking, 


at normally 
regulate taste preferences. To be 


specific: When brain shock was re- 
ceived from contact with both fluids 
presented together for choice, the “re- 
sponders” showed a normal taste pref- 
erence of CHO + shock to Q + shock, 
and the "nonresponders" showed a di- 
rect monotonic relation between the 
numbers of tongue contacts and CHO 
concentrations of 4, 8, 16, and 32 CHO 
(see Asdourian's Figure 3). Hence 


there is evidence that septal stimula- 
tions retarded the rate of licking and 
inhibited ingestion but did not dis- 
turb the mechanisms that 
regulate taste preferences. 

One subject in the group of “re- 
sponders” differed from the others in 
that he was totally indifferent to 
palatability relations; normal pref- 
erences did not influence behavior. 
When brain shock could be obtained 
by contact with either fluid, this sub- 
ject divided contacts evenly between 
them regardless of palatability thus 
indicating no preferential discrimi- 
nation. 

Taken as a whole, Asdourian’s data 
indicate that septal stimulation pro- 
motes contact with fluids that produce 
such stimulation but retards normal 
licking and hence ingestion. One result 
is certain: We cannot regard septal 
stimulation as a kind of general 
pleasantness that can be combined with 
hedonic effects from tasting quinine, 
sucrose, or distilled water to change 
the level of acceptability, Apparently 
septal stimulation does not affect the 
neural mechanisms that normally regu- 
late preferences. We conclude that the 
neural mechanisms that regulate taste 
preferences are not located within the 
septal region of the limbic system and 
that further research will be necessary 
to determine where these mechanisms 
are located and how it happens that 
Sweet-tasting fluids normally facilitate, 
and bitter-tasting fluids normally in- 
hibit, ingestion. 


normally 


Neural Mechanisms of Appraisal 


Arnold (1960) has given a compre- 
hensive definition of emotion which in- 
cludes complex states such as fear, 
rage, and lust and also simple feelings 
of Pleasantness and unpleasantness 
arising from sensory stimulations. 
Basing her theories on the research 
of Pribram, MacLean, Papez, and 
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Fig, 7. Acquisition of licking 
water. (Plotte 


“estimative” 
es the cere- 
mechan- 


others, she describes an 
neural system which involv 
bral cortex and subcortical 
isms. This "estimative" system func- 
tions in the appraisal of complex stim- 
ulus situations prior to emotional re- 
sponses. It functions also in the ap- 
praisal of simple sensory stimulations. 


Arnold regards feelings of pleasant- 
hess and unpleasantness as rudimen- 
s neural 


tary appraisals which occur a 
impulses relayed from receptors to the 
midline and intralaminar thalamic 
nuclei, It is a matter of theoretical 
importance that rudimentary appraisals 


of sensory stimulations are made by 
terms of 


built-in neural systems in 

biological "good" or “pad.” Further 
study of these systems and their anal- 
ysis at the physiological level is a 
matter that merits the highest priority. 


vE PROCESSES 


Tuer Rote or AFFECTI 
ING 


iN MOTIVATION AND LEARN 
Acquisition and Extinction of Licking 
Response to Solutions of Different 
Hedonic Intensities 

The relation between acquisition and 
extinction of the licking response to 
sucrose solutions that differ in com 


EXTINCTION 


response to four sucrose solutions and extinction with distilled 
d from unpublished data by Young & Trafton.) 


centration (hedonic intensity) was ex- 
plored in an unpublished study by 
Young and Trafton. The rats, after 
preliminary exploration of the testing 
box, were offered a sucrose solution 
delivered through a nozzle for 60 sec- 
onds. The number of licks of the 
fluid during the exposure was counted 
electrically. There were four groups 
of subjects, each composed of eight 
naive, need-free rats. The subjects in 
one group were rewarded with 32 
CHO; in another group, 8 CHO; in 
other, 4 CHO; in another, 1 CHO. 
Results are presented in Figure ie 
This figure shows that during the first 
5 days (apart from two minor inver- 
sions on Day 3) the mean numbers of 
licks per minute were directly propor- 
tional to the CHO concentrations. 
Since the level of performance is pro- 
1 to known hedonic intensities, 
this will be called the hedonic in- 
tensity pattern. After the first 5 days, 
however, the curves for Groups 4 
CHO, 8 CHO, and 32 CHO cross and 
recross, and there are no significant 
differences among them. On Days 10 
and 11 the amount of licking was 


slightly (not significantly) greater with 


an 


portiona 
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the 8 CHO group than with the: 64 
- » The rats in the 1 CHO 
CHO Bron ently and significantly 
group consistently i RD e) 
licked the fluid at the slowes z » 
Since the rates of licking daring oed 
611 agree with what is known a os 
activation of the tongue by CHO solu- 
tions (see above discussion), this will 
be called the habituated activation 
patlern. Learning is marked by a 
transition from the hedonic intensity 
pattern to the habituated activation 
pattern. The influence of hedonic in- 
tensity in organizing behavior is thus 
shown most clearly during the early 
stages of learning. 
After Day 11, all rats were offered 
an incentive of distilled water for 5 
days in the same nozzle. For all 
groups the substitution of distilled 
water for a sucrose solution brought 
an immediate drop in the amount of 
licking. During the 5 days of extinc- 
tion, the 1 CHO group showed the 
least change and the 32 CHO group, 
the greatest change in amount of lick- 
ing. On the fifth day of extinction the 
amounts of licking were inversely pro- 
portional to those of the hedonic-in- 


tensity pattern. It is a point of theo- 
retical interest that during extinction 
the amounts of ] 


icking are the inverse 
of those in the hedonic-intensity pattern 


rather than the habituated-activation 
pattern. The explanation, I think, 
must refer to the acquired disposition 
which is organized on the basis of 
hedonic processes. 

This finding agrees with results of 
various other experimenters. For ex- 
ample, Guttman (1953 ) found a similar 
inversion of the hedonic-intensity pat- 
tern during extinction of bar pressing 
when the sucrose reward was removed. 
The biggest drop in rate of bar press- 
ing occurred with rats that had been 
trained on the highest CHO concentra- 
tion and the least drop, with rats that 
had been trained on the lowest con- 
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centration. Again, Young and Shu- 
ford (1955) observed a similar inver- 
sion of the hedonic-intensity pattern 
during extinction of the habit of run- 
ning down a 6-foot straightaway for a 
daily 1-second sip of sucrose solution. 
Thus the hedonic-intensity pattern is 
inverted (after learning) when the re- 
ward is removed. 


Learning Preferential Discriminations 
with Paired Stimuli 


In an carly experiment (Young, 
1933) abrupt shifts of preference were 
observed when one test food was sud- 
denly substituted for another. The 
abrupt shifts were commonly followed 
by preferential trends which occurred 
more or less gradually with repeated 
choices. In subsequent eta 
abrupt mutations of preference and 
more gradual trends have been “3 
peatedly observed and studied. A ny 
erential trend indicates [earning a pre 1 
erential discrimination with repeatec 
choices. anil 

To illustrate preferential trends uin 
shifts I have plotted some data (see 
Figure 8) from a study by Young 
and Christensen (1962). The curves 
show percent choices of a compound 
solution made by a group of eight 
need-free rats tested on the brief-ex- 
posure preference tester, The subjects 
were offered a series of choices between 
a simple sucrose standard of 8 CHO or 
4 CHO anda compound containing the 
Same concentration of sucrose as the 


Standard plus 2 NaCl or 4 NaCl. In 


cach series of tri 
NaCl in the co 
increased from 

The curves c 
in terms of kn 
We know that 
4 CHO. We 
2 NaCl solutio: 
rejection not f. 


als the concentration of 
mpound was suddenly 
2 NaCl to 4 NaCl. 

an be readily interpreted 
own hedonic principles. 
rats prefer 8 CHO to 
also know that a simple 
n is within the range of 


ar above the indifferent 
cencentration and that 4 NaCl is more 
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equal 10096 minus the values plotted. From 
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data by E the simple sucrose standard, 
oung & Christensen, 1962. 


intense 
nsely negative. Now the first 


Cury 
the 50 at the left is only slightly above 
of 2 X (chance) level. The presence 
not ees in the mixture gave à slig 
When "4 nificant) rise in the curve, but 
Snly in NaCl concentration was sud- 
ac] oe from aCl to 
ecline here was a marked and steady 
“ompou in the percent choices of 
Moved nd, and the preferentia 
teji a the limit 
Plete 2 of the compound an 
standard PE of the simple : 
he addi 7 The third curve shows tha! 
dupes chen of Z NaCl © 4 CHO po 
Father a downward preferential trend 
Postitive IM Au indifferent OT 
appeared effect similar to 
nde at the 8 CHO level 
4 ceristandable in view of the fact that 
E is less acceptable than 
at a weak negative ;alence may 


and 
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n n 
facilitate acceptance wi 
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d, above, 


CHO-Q mixture). But when the 
NaCl concentration was increased from 
4 NaCl, there was an im- 
shift in the percent 
f the compound followed by 
e—a trend towards com- 

of the compound and 
f the simple CHO 


choices © 
a steady declin 


lete rejection 
complete acceptance o 
standard. 
These cur 
references. 
learn t 


indicate immediate 
o show that sub- 
o make preferential dis- 
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i nd moves, the steep- 

nor the abrupt shifts 
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SUCCESSIVE CYCLES OF TESTS 
Fic. 9, : Development and stability of a pref- 
erential hierarchy, (Percentages of choices of 


the preferred food are shown below a curve, 


of the nonpreferred food, above. Redrawn 
from data by Young, 1947.) 


Development and Stability 
erential Hierarchy 


of a Pref- 


When preference tests 
atically made with three 
foods, the subjects develo 
hierarchies which are r 
This came to light in an early experi- 
ment (Young, 1947 ). The study tested 
three foods of known palatability rating. 
Crystalline sucrose (S) is preferred 
by rats to whole wheat powder (W). 
The difference in palatability between 
these foods is small, but both are 
definitely more acceptable than casein 
(C). 

Series of counterbalanced preference 
tests were run with three pairs of test 
foods: S-C, W-C, S-W. Cycles of 
preference tests were planned so that 
no pair had any advantage of priority, 
and all single foods had an equal op- 


are system- 
or more test 
p preferential 
elatively stable, 


portunity to influence choice. The re- 
sults are reproduced in Figure 9. 

The curves in Figure 9 illustrate 
three important points. 'irst, a per- 
centage of choices indicates the relative 
difference in acceptability between a 
pair of test foods and not absolute 
hedonic intensity. The lowest curve 
gives the percentage of choices of the 
preferred food (S) when paired with 
W. S and W are high in hedonic in- 
tensity (under the diet of the experi- 
ment), but the difference between them 
is relatively small as shown by per- 
centages which are near the indiscrimi- 
nate (50%) level. In the sucrose- 
casein tests the percentages of choices 
of sucrose are consistently the highest 
of all, indicating that the difference in 
palatability between these foods is 
relatively large. Similarly, in the 
wheat-casein tests the percentages are 
almost as high as those in the sucrose- 
casein tests. This agrees with the fact 
that W is only slightly less palatable 
than S. 

Second, during the first four cycles 
all percentages of preference show a 
simultaneous tendency to rise. The 
learning of three preferential dis- 
criminations went on simultaneously, 
but the preferential hierarchy did not 
change. Preferential relations were 
the same from cycle to cycle. Learn- 
ing to discriminate did not modify the 
basic hedonic relations. 

Third, during the fifth cycle of tests 
the rat colony suffered an epizootic of 
respiratory disease. Several subjects 
died, others became slow and irregular 
in running, Figure 9 shows that dur- 
ing this epizootic the subjects were less 
discriminating than when healthy in 
the first four cycles, But despite the 
drops in all percentages and the slower 
running, the preferential hierarchy re- 
mained unchanged. 

In general, when rats are system- 
atically tested with three or more foods, 
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they develop a relatively stable pref- 
erential hierarchy or yalue system. 
lhis value system is based upon pri- 
mary hedonic processes which are in- 
dependent of learning. The acquired 
Value system may without 
change during sickness. 


persist 


GENERAL CONCLUSIONS 


P The experiments considered above 
Support the following general anette 
sions: 

1. The postulate that affective proc- 
esses have an objective existence 
viun the nervous system is confirmed 
?y a group of psychophysical experi- 
ments, With the up-and-down psy- 
chophysical method affective processes 
can be studied objectively and quan- 
Histively with remarkable precision. 
This method has made it possible to 
is give objective palatability ratings 
um s and compound solutions 
“Sing rats as subjects and a sucrose 
Scale, (b) plot isohedons in four 
ep stimulus areas that are de- 
ned by the concentrations of two 
solutes, (c) plot the locus of indiffer- 
nee and thus map regions within a 
qs area where solutes have posi- 

» neutral, and negative hedonic € 
bu (d) provide a method for ot 

Sing individual differences in a 
cavity, (e) provide norms of ac- 
s by and preference for need-free 
e bjects that can be used in 2 variety 
br em involving deprivation, Sies, 
and operation, implanted elec " 

other conditions. 
4. An important distinction has been 
p, n between activation of the tongue 
n Minus (intake) and id 
"der. The rate of licking iem 
of T. systematically with comp 
he taste solution. wh 


en rats 

em l when ks 

act oved as subjects. Tridimension? 
lVity surfaces and 


pidimension® 
Activ; i va 
ae contour maps fail to provi 
adequate basis for predic 


dr 
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preferences and vice versa. Also, 
instrumental acts motivated by food 
reward fail to provide a dependable 
basis for predicting preferences. Acti- 
yation is one thing; preference is some- 
thing else. 

3. Some fluids are immediately ac- 
cepted by the subject, others im- 
mediately rejected. If compound fluids 
that contain solutes of positive and 
negative valence are accepted, the posi- 
tive components are prepotent. The 
use of compound solutions makes it 
»ossible to study interaction of posi- 
tive and negative components within a 
mixture. Some mixtures show alge- 
braic summation of hedonic intensities, 
others reveal interaction effects. 

4. Measures of preferential taste 
thresholds vary with the method em- 
ployed in testing and with motivating 
conditions. There are different kinds 
of thresholds: sensory and preferential, 
physiological and behavioral, thresh- 
olds of reinforcement. . 

5, Current studies confirm earlier 
findings that the level of acceptability 
of a food varies with organic need 


states produced by depletion, surgical 
operation, and other bodily conditions. 
Studies of need-free (nondeprived ) 

ts show that the development of 
es through experience 


iv rior 
aversive behav 
(immediate or repeated) must be dis- 
tinguished from the checking of inges- 
o 


i internal organic conditions as 
pod as approached. On the posi- 
ces side there is evidence that ac- 
a behavior develops on the basis 
f hedonic effects and that the level of 
acceptability of a fluid is dependent 


upon the organic need state. 
6. The neurophysiology of pleasant- 

s and unpleasantness is largely un- 
sou unexplored. The total 
p mo suggests that sensory stimula- 
ud roduce primary evaluations, or 
echa 1s, in terms of “good” or 
E Eh un d s of innate neural 
ya 


and 


he basi 
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mechanisms. The genetic basis of pri- 
mary evaluation is found in untold 
eons of biological evolution. As a 
result of evolution there are built-in 
neural mechanisms that evaluate gus- 
tatory stimuli immediately in terms of 
acceptance-rejection and preference. 
The nature and functioning of these 
neural mechanisms can be investigated 
objectively and quantitatively. 

7. A distinction must be drawn be- 
tween the organizing function of af- 
fective processes and the persisting dis- 
positions which are organized. The 
organizing role of hedonic processes 
is most clearly apparent during the 
early stages of development. The 
acquired dispositions can function as 
independent organizers and regulators 
of behavior. In man, the acquired dis- 
positiona are called habits, attitudes, 

xpectancies, motives, etc. 

8. Some psychologists include the 
organizing-regulating functions within 
the concept of motivation, others, 
within the concept of learning. It is 
here suggested that hedonic organiza- 
i a regulation be studied for its 
ius AM also in relation to mo- 
Hee Me yen Further, it is 

experimental hedonism 
be extended from rats to other sub- 
jects, including the human child and 
adult, to other kinds of stimulati 
(especially tactile and thermal) wes 
to other forms of behavior such as sex 
play, curiosity, exploration, etc. j 
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] he Overlearning Reversal E 
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It is argued that the ORE is 
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nonovertrained animals 
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mple position problems. 
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but not 
A number of studies have investi- 
flect of overtraining on the 


a simple position habit. 
+. is found either that over- 
as no effect on reversal trials 
ion (Clayton, 1963b; D'Amato 
& Jagoda, 1962; D'Amato & Schiff, 
1964, Experiment 2; Hill, Spear, 
& Clayton, 962, Experiment ll; 
Komaki, 62; Mackintosh, 1963a, 
Experiment V, Group ]) or that it 
significant decrement in 
f reversal learning (Clayton, 
. Galanter & Bush, 1959; Hill 
. Hill, Spear, & Clayton, 
-xperiments 4» n re IV; 
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often found in brightness problems, 
and rarely found in simple Position 
learning. How are learning theorists 
to reconcile this dichotomy, and how 
are they to explain the ORE? 

First, let us consider why the ORE 
should be surprising. Reinforcement 
learning theorists have generally made 
the following assumptions: (a) Habit 
strength increases monotonically with 
the number of reinforcements; (b) the 
greater the habit strength, the greater 
the resistance to extinction; (c) re- 
versal learning is simply a matter of 
extinguishing an old habit and then 
learning the opposite one. From these 
three assumptions, theorists have 
properly deduced that the ORE should 
not occur. Since it does occur, one 
or more of the three assumptions must 
be unjustified. The goal of this paper 
is to consider which of these three 
should be changed and to propose a 
modification. 

To the writer's knowledge, the ORE 
has not prompted anyone to suggest 
that we abandon the first assumption. 
A number of proposed explanations 
would change the second assumption, 
and even more would modify the third. 
As for the second, there is some evi- 
dence that resistance to extinction is 
not a monotone function of the num- 
ber of rewards, even under 100% 
reinforcement (e.g., North & Stim- 
mel, 1960; Siegel & Wagner, 1963). 
But it is hard to see how the second 
assumption can be modified so that 
it predicts the ORE in brightness 
problems, but not in position learning. 
Moreover, there are data within the 
very experiments in which an ORE 
is found which suggest that resistance 
to extinction is greater for the over- 
trained subjects than for the non- 
overtrained ones—in accordance with 
the second assumption. The length of 
the initial string of errors in reversal 

seems a reasonable indicator of re- 


ErijAH Lovejoy 


sistance to extinction. And there is 
substantial evidence that this string 
is longer for overtrained than for non- 
overtrained subjects (Hill et al., 1962; 
Mackintosh, 1962). Furthermore, in 
the one test which has been made of 
trials to extinction with and without 
overtraining (Mackintosh, 1963c), it 
was found that the overtrained sub- 
jects took longer to extinguish. 

Thus it appears that in order to fit 
the ORE into learning theory we will 
have to change assumption (c). We 
cannot maintain that reversal learning 
is simply a matter of extinguishing an 
old habit and then learning the oppo- 
site one. 

Explanations for the ORE which 
would revise the third assumption gen- 
erally have assumed that for some 
reason there is positive transfer from 
original learning to reversal learning; 
that is, during overtraining the rats 
learn something which is useful to 
them in reversal. Such explanations 
include Reid’s “response of discrim- 
inating” (1953), Spence's “reduc- 
tion of position-response tendencies" 


(as quoted in Reid, 1953), Harlow's 
"reduction of 


error factors" (1949), 
and Sutherland’s "switching in of 
stimulus analysers” (1959). These 


last explanations vary in the degree 
to which they are Specific to the 
problem at hand. The most general 
proposal might be of the "learning 
set" type, and assume that all learning 
experience is helpful for all future 
problem solving. While this could 
clearly predict an ORE, it could not 
easily predict that an ORE will be 
found in a brightness situation and 
not in position problems. 

The theory that will be developed 
in this paper, although developed in- 
dependently, is quite close to Suther- 
land's idea of Switching in stimulus 
analyzers, and to the thinking of 
Zeaman and House (1963). Those 
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familiar with Wyckoff's (1952, 1954) 
papers will notice a formal similarity 
between his ideas and those presented 
here. 

It is argued that in order to solve a 
discrimination problem, animals must 
first learn to attend to the relevant 
dimension (e.g., brightness), and then 
they must learn to make the ap- 
propriate response once they have 
attended to that dimension. One 
rationale for such a theory follows: 
An animal in a two-choice apparatus 
like a T or Y maze makes a "'decision" 
at the choice point. He then pro- 
ceeds to the end of the alley and is 
either rewarded or nonrewarded. At 
the time of reward or nonreward, the 
rat must have stored information 
about which choice he made if he is 
ever to "learn" from the experience. 
(This information may be in memory, 
or it may be available externally in 
goal-box cues.) For if the rat in the 
goal box does not “know” whether he 
chose white or black, then he cannot 
associate either of these stimuli with 
reward or nonreward. Now surely 
a rat has limited information-storage 
capacity. On a given trial he might 
remember which side he chose, whether 
he approached or avoided some special 
odor, whether he repeated his pre- 
vious response, whether he went to 
black or white, whether he went to 
the side where he was most poa 
rewarded, etc. But he probably wi 
not remember all these various pos 
butes of the choice he mace. T. = 
development which follows 1S ae 
on the simple idea that a rat 
attend to only some of the possible 
dimensions on any given trial, an 
that in some sense learning can Ad 
only on those trials when the spem 
is attending to the relevant rà : 
sion. lt has been argued before m 
such a “selective Ao ' ZR » 
nism could explain the 


tosh, 1962; Sutherland, 1959). The 
argument is that overtrained animals 
will continue to attend to the relevant 
dimension throughout reversal and 
so will learn the reversal quickly. 
Nonovertrained animals, on the other 
hand, will abandon the relevant di- 
mension after reversal because it is 
consistently leading them to the 
"wrong" choice. Only after a period 
of responding to other dimensions will 
the animals rediscover the relevant 
dimension and learn the reversal. 
This seems a reasonable sort of ex- 
planation, but it is somewhat loose. 
In the sequel, two different mathe- 
matical models are presented which 
are based on this type of verbal 
“attention” theory. It will be seen 
that although they differ slightly in 
the assumptions made, both predict 
the ORE in some cases and not in 
others. For both these models, the 
correct choice in initial learning will 
be denoted by W and the incorrect 
choice, by B. This simplifies dis- 
cussion of reversal learning and elim- 
inates the necessity of phrases like 
“the new correct choice for reversal, 
which was previously the incorrect 
choice...” This notation should 
not be interpreted to mean that 
these models are concerned only with 
“brightness” problems. As will be 
seen, they are fairly general. 


MopEL I—A Two-STaGE MODEL 


Assumptions 

The first attention-motivated model 
which will be presented is a two-stage 
chained model. It assumes that on 
each trial an animal may or may not 
attend to the relevant stimulus di- 
mension, with the probability of at- 
tending on Trial » denoted by Pa(4). 
Jf he attends, he then has a certain 
conditional probability of making the 
correct choice, P.(W|A). If he fails 
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START 


Fic. 1. The state diagram for the first 
model. (On any trial a subject’s performance 
is represented by one of the four paths: AW, 
AB, AW, and AB.) 


to attend, we will assume that he 
has a success probability of 1. Thus 
four different events are possible : 


1. The animal may attend and 
choose white: Event A W (rewarded). 

2. The animal may attend and 
choose black: Event 4B (not re- 
warded). 

3. The animal may fail to attend 
and go to white: Event AV (re- 
warded), 

4. The animal may fail to attend 
and go to black: Event AB (not 
rewarded). 


This model is portrayed in the tree in 
Figure 1. 

We must now make some assump- 
tions about how the various events 
change the two probabilities P(A) and 
P(W|A). For this model we will 
assume that reward and nonreward 
affect the attention process just as 
they affect overt responses. That is, 
we will assume that when Event AIV 
occurs (the animal attends, chooses 
white, and is rewarded), then on the 
next trial he will be more likely to 
attend, and having attended he will 
be more likely to make the cor- 


rect choice. Both P(A) and P(IW|A) 


will be increased, each approaching 
a limiting value of unity. When 
Event AB occurs (the animal attends, 
chooses black, and is not rewarded), 
then, since attending was not re- 
warded, it is assumed that P(A) will 
decrease; but going to black after 
attending was not rewarded, so it is 
assumed that P(IV|A) will increasc. 
When the animal does not attend, it is 
assumed that no changes will be made 
in P(W|A). This is an important 
assumption in the present model and 
differs from the corresponding one 
made in the second model. For the 
present model, when Event AIV oc- 
curs, not attending is rewarded, and 
so P(A) will decrease. But P(W|A) 
will not change. Similarly, when 
Event AB occurs, not attending is 
nonrewarded, so P(A) should increase, 
while P(W|A) is unchanged. ‘These 
results are summarized in Table 1. 
We now have the basic framework 
for a model. We must still assume 
the exact amount of the trial-by-trial 
changes in the two probabilities. In 
hopes of mathematical simplicity, 
linear transformations are assumed. 
It should be mentioned that this is 
a most arbitrary assumption and is 
made strictly for mathematical con- 
venience. It is quite possible that 
some other probability transforma- 
tion rule, while still maintaining the 
limit points implied by Table t, could 
provide an even better description 
of the real world. Linear operators 


TABLE 1 


CHANGES IN P(.1) AND P(W|A) WHEN THE 
Pour DIFFERENT E TS OCCUR 


dm Change i Change in 
Event PG m PAPIA) 
AW (rewarded) e| iu T 
AB (nonrewarded) = F 
ATI (rewarded) = 0 
AB (nonrewarded) + 0 
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which will change the two probabili- 
ties in the directions indicated in 
‘Table 1 are written in Table 2. 

At this point Model I has been com- 
pletely specified. It would be nice 
to derive some properties of the model, 
such as the explicit formula for the 
learning curve, the expected 


mean 
This has 


total number of errors, etc. 
not been possible, despite consider- 
able effort on the part of the writer 
and others. However, we can get 
a about the properties of the 


some idee 
model by the "stat-rat" method, dis- 


i detail by Bush and Mosteller 

In brief, this method pro- 
onse protocols of 
d be found in a 


cussed ii 
(1955). 
duces sample resp 
the type that woul 
population of “statistical rats" which 
behaved according to the assumptions 
above. Before we look at some stat- 
rat computations, however, let us con- 
sider at an intuitive level the condi- 
tions which might produce an ORE 
with the present model. 

Consider an animal who starts out 
with an initial probability of attending 
to the relevant dimension, Py(A) 7-5. 


Suppose he has no response bias, so 
that P,(W|A)-.5, too. Now in 
the rat 


the problem, ! 
end on all trials and 


hite once he has 
back at Table i, 


order to solve 
must learn to att 
to always go to W 


attended. Looking 
we can see that P(W|A) cannot 
decrease. It can only increase O! 


The probability 
other hand, may 
depending on the 

Thus if similar 
he changes in 
vould expect 
r than P(A). 
at stat 


remain unchanged. 
of attending, on the 
increase or decrease, 
event which occurs. 
rate parameters govern t 
both probabilities, We Y 
P (W|A) to increase faste 
Indeed, when one looks 
rat curves, this is what happens: 
P(W|A) reaches asymptote well be- 
fore P(A) does. Because of this, 
animals who have just met a criterion 
of learning but have not been Over- 


TABLE 2 
LINEAR TRANSFORMATIONS OF THE PROB- 
ABILITIES P4(4) AND P,(1V|A) Waica 
ARE POSTULATED TO OCCUR AFTER 
THE VARIOUS EVENTS 


Event Psa(A) Prat(WIA) 
AW aPalA) 4-1 -ailas Ps (IF | 4) +1 —as 
AB asPSCA) JaaPa (WIA) 3-1 -as 
All ei Pa (et) P4UV]A) 
AB | aPa(A)+1—ax P,(W|A) 


Note.—a1, as, as, and 3 ers si 
oc zi as are rate parameters such that 


trained have P(IF|.4) about equal to 
1, and P(A) lower, perhaps about .9. 
Overtrained animals have both prob- 
abilities very close to 1. At the 
start of reversal, then, the nonover- 
trained subjects have a very low prob- 
ability of Event 4B—since P(B|A) 
-1—P(W|A) «0. The probability 
of Event AB, while small, is consider- 
ably greater. Thus these subjects 
are likely to get their first reversal 
reinforcement by Event AB. But 
this will reduce their probability of 
attending, since they have been re- 
warded for not attending, and Event 
AB will become even more likely. As 
this sort of process recurs, the non- 
overtrained subjects will be likely to 
stop attending to the relevant dimen- 
sion—and hence to stop learning the 
reversal. The overtrained subjects, 
on the other hand, have both P(A) 
and P(W|A) very close to 1 at re- 
versal. And if the two nonreward 
rate parameters, ae and ay, are ap- 

roximately equal, then P(A) and 
P(W|A) will decrease together during 
the string of initial reversal errors. 
For these subjects, P(AB) will be 
greater than P(AB) carly in reversal. 
Thus they will be more likely to get 
their first reversal rei nforcement while 
attending. This, in turn, will in- 
crease P(A), decrease P(W|A), and 
increase the likelihood that future re- 
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Fic. 2. Group reversal curves for two 


groups of stat rats, calculated according to 
Model I. (For original—prereversal—learn- 
ing it was assumed that P,(A) = 3X) 


wards will come by AB rather than 
AB. These subjects will be likely 
to keep P(A) high during reversal. 
So they can decrease P(W|A) by 
means of reversal operators analogous 
to those in Table 2 and learn the re- 
versal comparatively quickly. 

The essence of this argument is that 
overtrained animals will continue to 
attend throughout reversal, while non- 
overtrained ones will be likely to stop 
attending to the relevant dimension 
for a considerable length of time 


during reversal. This will produce 
the ORE. 


Predictions 


In fact, if stat rats are run with con- 
ditions like those above, an ORE is 
found. In Figure 2 are plotted the 
reversal learning curves for two groups 
of stat rats. These Statistical ani- 
mals started original learning with 
Pi(A)=.5 and P,(W|A)=.5. It 
was assumed that the same pair of 
rate parameters governed the changes 
for both the probabilities, so that 
a,=a3;=.93 (the reward parameter) 
and az=a4=.99 (the nonreward para- 
meter; see Table 2). A high value of 
a in the operators shown in Table 2 
corresponds to a slow change in prob- 
ability. Thus nonreward was here 
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assumed to be a less powerful event 
than reward. ‘This asymmetry is 
desirable because of the data found in 
such experiments. Just after rever- 
sal, overtrained animals may make 
as many as 40 errors before the first 
reversal success (the average found by: 
Mackintosh, 1962). This implies a 
very weak effect of nonreward. But 
learning can take place fairly quickly 
in the black-white problem; so the 
reward operator must be more power- 
ful, as was assumed here. 

Animals in the nonovertrained 
group in Figure 2 were trained in 
original learning until they had 15 
consecutive trials without error. At 
this point they were reversed, and 
the group reversal learning curve is 
plotted. The overtrained animals 
were trained for 150 trials past this 
criterion (i.e., they were trained well 
past asymptote) and were then re- 
versed. Their group learning curve 
is also shown. Plainly, the over- 
trained group starts with a lower 
initial probability of correct, but after 
a small amount of reversal training 
the curves cross, and the overtrained 
group stays superior from that point 
on. The present model has predicted 
an ORE. 

Experimenters reporting the ORE 
have not presented learning curves, 
however. They have simply reported 
the trials to reach criterion in re- 
versal for overtrained and nonover- 
trained animals. As D'Amato and 
Schiff (1965), have pointed out a cross- 
over in learning curves may occur so 
late in reversal that no significant 
difference is found in reversal trials 
to criterion. For the stat rats re- 
ported above, however, the crossover 
occurs early, and there is a significant 
ORE in terms of trials to criterion. 
The nonovertrained subjects required 
108.3 trials to reverse and the over- 
trained, 86.1, on the average. These 


bs — 
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figures may be compared with acquisi- 
tion where stat rats required 77.2 
trials to reach the same criterion. 
These are the means for groups of 60 
animals, and a ¢ test applied to the 
difference indicates significance at the 
.005 level. 

‘Those familiar with previous experi- 
ments on the ORE may wish to know 
more than the means and the group 
learning curves reported above. Do 
overtrained animals uniformly reverse 
more quickly than nonovertrained 
subjects, or is the difference merely 
one of averages? This question is 
answered in Figure 3. Figure 3 
shows the empirical frequency distri- 
bution of trials required to complete 
reversal criterion for the two groups 
of stat rats. lt may be seen that the 
overtrained subjects tend to learn the 
reversal more quickly, but it is clear 
that there is substantial overlap be- 
tween the two distributions. The 
groups are different, but the reversal 
score for a sample subject would not 
suffice for identifying his previous 
treatment. 

Part of our goal 
plished. We have 


has been accom- 
shown that a sim- 
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groups of stat rats calculated according to 
Model I. (These groups are comparable to 


those in Figure 2 in every detail, except that 


for these animals it was assumed that Pi(4) 
— 10. That is, the stimuli were assumed 


very obvious in original learning.) 


assumptions, motivated 
by the idea that attention is important 
in discrimination learning, have pro- 
duced a model in which the ORE 
can occur. But there remains an im- 
portant step to be taken. Recall that 
the ORE is generally not found in 
position problems. We must now 
consider the conditions under which 


the present model predicts no ORE. 
In discussing the intuitive reasons 


why the present model should predict 
an ORE, we pointed out that for 
ained animals P(W|A) 


the nonovertr j 
reached asymptote before P(A), since 
and P(W|A) in- 


both started low i 
creased faster. But the situation is 
different when the initial probability 


of attending, P(A), is high. This 
might well be the case for a posi- 
tion problem. The rat is a nocturnal 
animal, and it seems reasonable to 
assume that position is a very obvious 
cue for such a beast. If we assume 
that it is completely obvious, that is, 
that P1(4) 71 for the rat ina position 
problem, then P(A) starts out at 
asymptote. So unless it were to 
drop sharply—and there is no reason 
to expect that it would—then no 


ple set of 
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"first reversal reinforcement" argu- 
ment like the one for brightness 
problems can be made. We have no 
reason to expect the ORE. 

In Figure 4 are shown reversal learn- 
ing curves for two groups of stat rats. 
The parameters for Figure 4 are iden- 
tical to those used for generating the 
curves in Figure 2, except that for 
these rats it was assumed that P(A) 
=1.0: The stimuli were taken to be 
completely obvious, as they might be 
in a position problem. As before, the 
nonovertrained subjects were reversed 
after meeting a criterion of 15 consecu- 

tive successes, and the overtrained 
group was trained for 150 trials past 
this point and reversed. There is 
no ORE in Figure 4. The over. 
trained group starts reversal inferior 
to the nonovertrained one and holds 
this inferiority throughout reversal. 
Judging from the group learning 
Curves, one would expect that over- 
training might have produced a signi- 
ficant decrement in reversal speed for 
this obvious problem. It did. The 
nonovertrained group met the reversal 
criterion in 81.1 trials, as contrasted 
with 96.6 trials for the overtrained 
group. (A £ test shows this difference 
to be significant, p <.02.) For this ob- 
vious problem, acquisition was much 


One more stat 
data is of interest: 


le first reversal 


success. For both problems the over- 


trained subjects made more such 
errors. For the animals with Pi(A) 
—.5, the means for overtrained and 
nonovertrained groups were 10.13 and 
5.83, respectively. For the problem 
with P;(A)=1.0, these means were 
11.12 and 6.0. Again, the stat-rat 
results closely parallel those found in 
experiments. 


Discussion 


The model which has just been de- 
scribed will, for reasonable parameter 
values, predict an ORE for stimuli 
which are fairly obvious but not com- 
pletely obvious. The initial prob- 
ability of attending, P(A), can be 
varied from about .3 to about .7 with- 
out removing the ORE. For P(A) 
<.3, learning is slow, and a great many 
stat rats "refuse to learn." While it 
has been proven that such subjects 
would eventually learn, many hun- 
dreds of trials might be required be- 
fore they did. With Pi(A)— 4, about 
1 subject in 25 “fails to learn" in a 
reasonable length of time. As P,(A) 
increases past about .8, the difference 
between overtrained and nonover- 
trained groups disappears until with 
Pi(A)-1.0 overtraining hurts re- 
versal (see Figure 4). This gives us 
a clue about conditions when the ORE 
might not be found in 
Situation: in apparatus where the 
stimuli are very obvious, so that the 
rat must almost alwa ys "notice" them, 
and P(A) is very high, D'Amato 
and his group have repeatedly failed 
to find the ORE in his automatic Y 
mazes (D'Amato & Schiff, 1965). In 
this situation the whole positive arm 
is diffusely illuminated by the light 
cue. And the whole negative arm is 
1,000 times darker. Thus the rats are 
literally “bathed” jn the stimuli, 
Perhaps this discrimination is just too 
obvious for an ORE to appear. 

There are other results 
D'Amato's laboratory Which indi- 
rectly confirm this interpretation. 
We have argued about the importance 
of the initial probability of attending 
to the relevant stimuli. In most T 
or Y mazes, a “position” problem in- 
volves motor cues (the actual turning 
response), abstract direction cues 
(going left or right), and extramaze 


the brightness 


from 


^ 


OVERLEARNING REVERSAL EFFECT 95 


cues (going to the place on the left, 
or the place on the right). This last 
class of cues (which would be most 
useful in forming a “cognitive map") 
is deliberately omitted from the auto- 
matic Y maze used in D'Amato's ex- 
periments. This might mean that 
“position” is a less dominant dimen- 
sion in this apparatus than in most 
others (and hence that other cues, 
like brightness, are more available). 
Perhaps this is why in five pure posi- 
tion experiments, D'Amato's group 
has never found any significant dec- 
rement in reversal speed due to 
overtraining (1962, 1964). Clayton 
(1963a), Galanter and Bush (1959), 
Kendler and Kimm (1964), and Hill 
and Spear (1963) have found such a 
decrement. (So did Hill, Spear, & 
Clayton, 1962, in three separate ex- 
periments. But they reversed their 
animals for a short, constant number 
of trials, rather than to a criterion. 
‘Thus their results may have been in- 
fluenced by the long string of initial 
errors which is generally made by the 
overtrained group in any overtrain- 
ing reversal study.) In one experi- 
ment in the automatic Y maze, bright- 
as consistently correlated with 
aking an especially obvious 
In this case, and in this 
overtraining led to a 
at in reversal trials 
to & Schiff, 1964). 
tes the argu- 
D'Amato's 
tcue than 


ness W 
position, m 
problem. 
case alone, 
significant decremet 
to criterion (D’Ama 
Perhaps this substantia 
ment that position, in 
apparatus, is a less dominan 
in more standard apparatus. 
Erlebacher also did a very careful 
brightness study and failed to find 
an ORE. If one looks at the vari- 
ous apparatus which have been used 
in brightness studies, one finds that 
Erlebacher's was the only enclosed 
T maze. lt had stimuli painted on 
the sides, the ends, and the floors ot 


the arms of the T. Thus in Erle- 


bacher's study, too, the rat was sur- 
rounded by the relevant stimuli. 
Perhaps this problem, like D'Amato's, 
was too obvious. A great majority 
of the studies in which an ORE has 
been found have used stimuli painted 
on doors which the rat knocked down 
or opened in making his choice. Only 
one other experiment has used a T 
maze (Capaldi & Stevenson, 1957), 
and that was an elevated T, 13 inches 
wide. The rat presumably had other 
things to occupy his attention besides 
the color of the paint on the floor. 
But if Erlebacher’s and D'Amato's 
apparatus were such that P1(4) was 
high for brightness, then this fact 
should be reflected in other ways be- 
sides their failure to find an ORE. lt 
is. We have argued that, in general, 
position is obvious and brightness is 
obscure. This difference affects the 
learning rates found in those two types 
of problems. Position is generally 
learned much more quickly than 
brightness. Hill and Spear, for ex- 
ample, found rats took about 17 trials 
to arrive at criterion in a T-maze posi- 
tion problem with a small food re- 
ward. (Contrast this with the 51 
trials required for position learning 
in D'Amato's automatic Y maze— 
D'Amato & Jagoda, 1962, Experi- 
ment IV). Brightness, on the other 
hand, is usually learned to the same 
criterion in between 60 and 80 trials 
(Mackintosh, 1962; Pubols, 1956; 
Reid, 1953). D’Amato’s rats fall into 
this range (D'Amato & Schiff, 1965), 
but Erlebacher's subjects reached cri- 
terion in 27.8 trials. Thus it appears 
that Erlebacher's animals learned 
brightness unusually quickly (indicat- 
ing an easy, Or obvious problem). 
D'Amato's animals learned position 
very slowly. This may indicate that 
position is an obscure dimension, and 
hence that brightness is relatively 
stronger; alternatively it may mean 
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ing is generally slow in that 
that leaming Se i bibe 
dead "Tun fast," and must have 
been obvious. - 

This post hoc theorizing does more 

than sweep the offending data under 
the rug: It also gives us a handle on 
the thorny problem of estimating 
P(A) for a given situation. The ob- 
viousness of a problem should be re- 
flected in its difficulty as compared 
with a standard position problem. If 
a problem is three or four times as 
difficult as a position problem, then 
one might well expect an ORE to 
appear. If it is only about twice as 
difficult, one could expect the ORE 
to be elusive. Experimentation across 
problems in a standard piece of appa- 
ratus is needed here. 

In order to “wrap up” the existing 
data on the ORE, we assumed that 
position was a very obvious cue for 
a rat, so that in a position problem 
Pi(A)=1. We must now face the 
fact that Pubols (1956) found an ORE 
in a position problem as did Capaldi 
in 1963. 

Both these studies were small; the 
first involved eight and the second, 
five rats in each group. Pubol's result 
Seems to have been a statistical acci- 
dent. His overtrained group learned 
the reversal more quickly than the 
nonovertrained group; but if one in- 
spects his data, one discovers that the 
overtrained group also learned the 
initial problem about twice as quickly. 
One suspects that Pubols had smart 
rats in his overtrained group. Hill 
and Spear (1963) replicated the con. 
ditions of Pubol's study in detail and 
found no ORE, but rather a significant 

decrement in reversal trials to criterion 
because of overtraining. 

Capaldi's study was smaller but is 


not so easy to explain away. Suther- - 


land has pointed out? that on an ele- 
vated T maze like Capaldi’s, rats are 
afraid, and rats under a strong fear 
drive will tend to fixate on one position 
in a T. Perhaps Capaldi’s nonover- 
trained group, because it had had less 
experience on the maze, was more 
afraid and hence more fixated by fear. 
It should also be mentioned that 
Capaldi did not train his rats to a 
previously set performance criterion 
but trained all nonovertrained sub- 
jects for a constant 16 trials. 'Thus 
although Capaldi's study does not fit 
in with the other data considered 
above, the discrepancy may be due to 
somewhat special factors, Further 
research is needed to resolve this 
problem. 

With minor exceptions, then, the 
present model seems to tolerate all 


existing data on the overlearning re- 
versal effect. 


Mover II—An ANALYZABLE MODEL 


The model which has just been de- 
Scribed seems promising. But it is 


mathematica] anal- 


» a search has been 
made for a model wh; 


Model I in its assumptions but which 
is sufficiently different so that some 
analysis is possible. One Such model 
has been found, and a complete report 
of the analysis appears elsewhere 
(Lovejoy, 1965). It was found that 
by making a seemingly innocent 
change in the assumptions of Model I 
it was possible to produce a model 
with many properties which can be 
determined analytically. 

Model I stated that two interacting 
learning processes combine in dis- 
crimination learning: learning to at- 
tend, and learning to respond appro- 


?N. S, Sutherland, 


T personal communica- 
tion, August 1964. 


cat 


à 
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TABLE 3 


TRANSE A d P 
ANSFORMATIONS APPLIED AFTER EACH EVENT AND LIMIT VECTORS FOR 
THESE TRANSFORMATIONS 


Probabi 
“cvent P(AW) P(AB) P(A) 
AW o 
Dont | TAM Ea aPa(4B) aPa(A) 
AB 25 i 0 
IMs aP (AW) + (1 2908 aPa (4B) PDA 
xw M 0 1—N 
Payis Pal AW; 
p aPa (AW l AF AB) aP,(A)31 n 
AB 
n afili HU —a)i aP, (AB)+(1—a)} aPa D 
PE 4 


attending. Linear oper- 
ators were introduced to describe 
probability changes at these two levels 
of learning. Model II is in a sense a 
single-level model: It assumes that 
on a trial an animal makes one of 
three responses. He attends and 
chooses white (Response AW), he at- 
tends and chooses black (Response 
AB), or he fails to attend (Response 
A). The attending response is totally 
bound to the two subsequent choices. 
In this simpler three-response model, 
any change in one probability will 
normally change both the others since 
the three must together sum to unity. 

The state diagram for Model II is 
the same as that for Model I and is 
given in Figure 1. The same four 
events are possible: AW (rewarded), 
AB (nonrewarded ,AW (rewarded), 
and AB (nonrewarded). The state of 
an animal in tua- 
tion can be 


priately after 


P,,(A)], where the three c 
the vector m 1 


of the four 
linear transformat 


vector is assumed. . j 
tions and their limit values are given 


in Table 3. A full description of the 


] events, 4 


justifications for these transformations 
is given in a separate paper (Lovejoy, 
1965), but the basic ideas are: (a) 
When a response occurs and is re- 
warded, it becomes more probable, 
and the probabilities of both the other 
responses are reduced by a factor a 
(0<a<1); (b) when a response oc- 
t rewarded, its prob- 


curs and is no 
esame factor, 


ability is decreased by th 
a, and the probability so lost is divided 
up among the other two responses in 
such a way that they approach con- 
stant limit points. Note, for example, 
that when Event AB occurs (the 
animal fails to attend and is not re- 
warded), the probability of nonat- 
tending is reduced by the factor a, and 
the probabilities of the two other re- 


each approach a limit value 


sponses c 
of à. This value 1s the result of 


a symmetry assumption—since the 
subject was not attending to the 
relevant dimension, he cannot “know” 
whether he went to black or white, so 
he should treat the two colors sym- 
metrically. When Event AB occurs, 
on the other hand, no such symmetry 
argument is applicable; accordingly 
the limit point must be left as a new 
free parameter in the model, NA 
More will be said below about the 
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lated according to Model II (Theorem 1) for 
two groups of animals with P,(A) = 0. 


mathematical and psychological im- 
portance of these assumptions and 
how they distinguish Model II from 
Model I. For the moment it is of 
interest to note that this model, too, 
can predict the ORE. Because the 
single-stage Model II is simpler mathe- 
matically, it has been possible to de- 
rive an explicit expression for the ex- 
pected mean learning curve for this 
model. This is done in Theorem 1 of 
the companion paper. Using this 
theorem, it is a simple matter to in- 
vestigate learning and reversal with 
various values of Pi(A), the initial 
probability of attending. Since "at- 
tending" is not a unitary response in 
Model II, we must add P\(AW) and 
P,(AB) to find the analogue of P;(A) 
in Modell. Figure 5 is the Model I1 
version of Figure 2 for Model I; the 
curves of Figure 5 are smooth be- 
cause for this second model the exact 
form of the expected mean learning 
curve can be predicted, and so there 
is no need to resort to stat rats to 
explore the model. Figure 5 shows 
the course of reversal learning for two 
groups of Model II animals. The 
nonovertrained group was reversed 
after 20 trials of acquistion, the over- 
trained group after 300. These curves 
are for reversal of a very difficult 
problem, in that P,(A) was taken to 


Lovejoy 


be 0. 


The rate parameter, o, was 
taken 


to be .93 for these computa- 


tions, and the limit point, \,, was 
taken to be 1. Under these condi- 
tions, 


the overtrained group clearly 
overtakes the nonovertrained group 
early in reversal. Like Model 1, 
Model II predicts an ORE if P(A) is 
low. 

Figure 6 shows similar curves, cal- 
culated with the same learning param- 
eters as Figure 5, but here the problem 
was very obvious: Pi(A) was taken 
to be 1. The nonovertrained group 
was trained for 10 trials before re- 
versal (learning is fast in this obvious 
problem); the overtrained group was 
reversed after 300 trials. For these 
subjects, overtraining has slowed the 
whole course of reversal learning. 
Again, Model I! parallels the predic- 
tion of Model I. 

The amounts of training given and 
the values chosen for P\(A) and X, 
are rather extreme for Figures 5 and 
6. These values have been chosen 
to accentuate the fact that this 
simple, single-process model can on ap- 
propriate occasions predict the ORE, 
and that at least qualitatively this 
model resembles the data which have 
been found. These two figures are 
representative of the types of curves 
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Reversal learning curves, calcu- 
lated according to Model II (Theorem 1) for 
two groups of animals with the same param- 


eters as those in Figure 5 except that here 
PA(A) = 19. en 
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‘which are very generally found for a 
„high value of Ai: When P(A) is high, 


the learning curves do not cross, but 
when P,(A) is low, they invariably 
do. Actual experimental conditions 
would probably correspond to less 
separated curves than those in Figures 
5 and 6, but the same general shapes 
could be expected. Because of the 
tractability of Model 11, it has been 
possible to prove the prediction of an 
ORE not only by means of curves like 
those shown in Figures 5 and 6, but 
also by investigating the relationship 
between the amount of prereversal 
training and the number of errors in 
reversal learning. ‘These results, and 
also the predictions of Model II for 
extinction conditions, are derived and 
discussed in the companion paper. 
Since Model ll can be handled 
mathematically, the reader may well 
wonder why so much time has been 
to a discussion of the less ele- 


given D _ the 
This point 1s central 


gant Model I. 
in the following section. 
A COMPARISON OF qu Two MODELS 
Two different models have been pre- 
for discrimination learning, 
each of which is capable of predicting 
an ORE with some parameter values. 
For the first modelit was assumed that 
"attending to the relevant dimension" 
was a response, which was strength- 


ened by reward and weakened by non- 


reward. "Choosing white after at- 
tending" was also assumed to bea 


response and was assumed to change 
in a similar fashion. This model has 
considerable intuitive appeal, but it 

ble to analyze mathe- 


has been impossi 
matically. The second model as- 


sumed that the subject made only one 
response on any trial, which might be 
"attend and choose white," "attend 
and choose black," or “do not attend 
to the relevant stimuli." This change 
in definition of the response (or of the 


sented 


pun utendo, 
pe e Mus 
this difference aff P d ur ROW aars 
ed iain t ees mathematical 
i E gical properties of the 
A crucial assumption in Model | is 
aig ii the subject fails to attend, 
change occurs i TE : 
chance of essere lo |4), s 
s s 'e on attending 
trials. This means that if for some 
reason a subject stops attending to the 
relevant stimulus for a number of 
trials, P(W|A) will remain at a con- 
stant value, even though the animal 
may be responding in the experi- 
mental situation. P(W|A) is dis- 
sociated from the learning process 
when the subject is nonattending. 
This dissociation is reminiscent of 
Krechevsky's notion of “hypothesis 
behavior,” and has strong experimen- 
tal implications which will be dis- 
cussed below. It also makes math- 
ematical analysis of Model 1 very 
difficult. 
Model ll, on the other hand, is 
closer to the “continuity” view of 
learning. All trials affect all prob- 
abilities. The crucial difference be- 
tween the two models is the effect of 
Event AB: nonrewarded nonatten- 
tion. In Model I this event makes 
the subject more likely to attend, since 
nonattending was nonrewarded, but 
it does change the relative strengths 


of the two attending responses. For 


Model IJ, too, Event AB makes at- 
But at the same 


tending more likely. 

time it moves both P(AB) and 

P(.AW) towards their limit value of $ 

(see Table 3). Consequently PQV|A) 

moves towards a limit value of 3, too. 

Thus, during a long string of trials 
when an animal is not attending to 
the relevant stimuli, Model JI pre- 
dicts that an animal "forgets" what 
he had previously learned about them. 


Model | predicts that the rat has 
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“memory” in the form of an un- 
changed P(W |). " 
There is some evidence that a dis- 
sociation" assumption like that in 
Model | may be necessary. Goodwin 
and Lawrence (1955), for example, 
ran rats in a T maze with two stimulus 
dimensions besides position (which 
was varied randomly). On each trial 
one arm was white and the other 
black; at the same time one arm had 


high hurdles, and the other low 
hurdles. These two cues were uncor- 
related. In Stage I, animals learned 


a brightness discrimination. In Stage 
II, they learned to discriminate high 
from low hurdles but with brightness 
cues still present. In the third stage, 
they either relearned the Stage I 
brightness problem, or they learned 
the reversal. It was found that re- 
learning was much faster than rever- 
sal, and that overtraining on Stage II 
did not change this result. It appears 
that the probability of going to white 
given that the subject was attending 
to brightness, P(W|A), stayed high 
throughout the second stage, despite 
the fact that brightness cues were 
present and randomly rewarded during 
Stage II. Model I, with its dissocia- 
tion assumption, could easily be ex- 
tended to encompass such a result. 
It is not easy to see how to make a 
similar extension of the second model. 
Another datum which adds to the 
plausibility of the dissociation as- 
sumption was first discovered by 
D'Amato and Jagoda (1960) and has 
since been confirmed by Mackintosh 
(1963c). These experiments, too, had 
threestages. In the first, rats learned 
a brightness problem to criterion. 
Then they were extinguished (neither 
response rewarded) either for a fixed 
number of trials (D'Amato & Jagoda) 
or to a criterion (Mackintosh). After 
this, they learned the reversal prob- 
lem. Both experimenters found that 
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just after reversal learning started, 
there was an abrupt drop in the pro- 
portion of correct responses (in the 
reversal). It appeared that these sub- 
jects had stopped heeding brightness 
in extinction, and that when they 
started to attend to brightness in re- 
versal, they still had the habit of 
choosing the formerly correct alter- 
native. It seems, again, that P(W|A) 
had been dissociated during the ex- 
tinction period. 

A third reason for liking this dis- 
sociation assumption is that Model I 
seems to be more nearly a correct rep- 
resentation of the real world than 
Model II. The parameter values for 
the Model II curves shown were care- 
fully chosen. Although the model 
often predicts an ORE, only in some 
rather special cases does it predict 
one which is as large as the experi- 
mentally obtained ORE. For Model 
I, on the other hand, it is easy to get 
an ORE. And the curves shown are 
honestly representative of the be- 
havior of the model over a 


large par’ 
of the parameter space, hae 


Some FINAL Discussion 


In both Model I and Model II, the 
initial probability of attending, P,(A), 
corresponds to the "obviousness" or 
“distinctiveness” of the stimuli. This 
variable proved to be crucial in de- 
termining whether or not an ORE was 
predicted. And it also has the de- 
sired effect on speed of learning: The 
higher the initial probability af at- 
tending, the faster learning t 
Indeed, one of the attractive feat 
of the present mod 
in the stimulus si 
speed of learnin 


irs. 
i ures 
els is that changes 
pne change the 
E by changin - 
ceptual parameter, P, (4), ane dun 
by changing a learning-rate parameter 
—which might better depend on such 
variables as motivation, or the species 


itt 


AA 


- other half, both were white. 
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of] the subject. ¿Results like those,of 
Gibson and Walk (1956) can similarly 
be explained by assuming that “per- 
ceptual learning" affects Pi(A). No 
change in rate parameter need be 
assumed. 


Attending and Observing—Not the Same 


A number of previous quantitative 
models for discrimination learning 
have been concerned with some sort 
of perceptual mechanism like the 
"attending" process in the present 
models. Wyckoff (1952) and Atkin- 
son (1960) postulated overt observing 
responses which were affected by re- 
ward or nonreward much as was at- 
tending in this paper. Their formu- 
lations avoided unnecessary “mental- 
ism,” but there is now evidence that a 
more central process is involved. 

Mackintosh (1962, 1964) has done 
a number of experiments which indi- 
cate that an allending response, rather 
than an overt observing response, is 
needed to account for the ORE. One 
af these was especially forceful. In 

964, Mackintosh did a two-stage 
study. In the first stage, animals 
learned to distinguish a square from a 
diamond. The square was the posi- 
tive stimulus, so it is clear that the 
rats learned all necessary overt re- 
sponses for observing it. On half the 
trials both stimuli were black; on the 
This 
change was irrelevant to the solution 
of the first problem, but ensured that 
the subjects were orienting towards 
the black and white squares. Half of 
the Stage 1 subjects were overtrained, 
and so, presumably, had learned the 
"observing" response especially well. 
Stage I] was a brightness discrimina- 
tion in which the stimuli were the 
same black and white squares which 
had appeared in Stage I. So if any 
overt observing OF orienting response 
is crucial for the ORE, the overtrained 


subjects should have learned the new 
problem faster. In fact, they learned 
it more slowly. Mackintosh inter- 
prets this result as meaning that over- 
training made the animal more likely 
to attend to spatial orientation—the 
relevant dimension for Stage I—and 
hence less likely to attend to bright- 
ness—the relevant dimension for Stage 
ll. This interpretation agrees nicely 
with the present models. 


Some Unsolved Problems 


Before we can completely under- 
stand a rat’s behavior in even the sim- 
ple black-white discrimination learn- 
ing situation, it appears that we must 
find satisfactory answers to a number 
of questions, some of which have been 
touched on above: How many dimen- 
sions can a rat attend to at any one 
time, or on any one trial? Both 
models have cunningly avoided this 
question by concentrating on atten- 
tion or nonattention to a single rele- 
vant dimension and not concerning 
themselves with other dimensions. 
While this ploy has been moderately 
successful, one would hope eventually 
to expand his understanding. It has 
long been known that an animal can 
be responding consistently to one 
position and still be learning some- 
thing about brightness (Spence, 1945). 
Indeed, while responding to position 
his latencies to black and white cards 
may indicate that he distinguishes 
them (Mackintosh, 19632; Mahut, 

1954; Maier, Glaser, & Kl 1940; 
Ross, 1962). But if an animal thus 
appears to be attending to more than 
one dimension, is he learning about 
more than one? And how many di- 
mensions interact in controlling be- 
havior on any given trial? Perhaps 


only a single dimension is effective. 
In this case, how does an animal shift 
dimensions? 


If an animal is respond- 
ing consistently along one dimension, 
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does that necessarily mean ae 
is learning about that dimension d 
more than one dimension is relevant, 
how do response strengths combine e 
shape behavior? What happens : 
two dimensions are opposed to cac à 
other? What if a dimension is only 
partially relevant Gea it helps aid 
subject to do better than chance, but 
does not guide him reliably to re- 
ward)? Do animals always learn, or 
do they sometimes adopt a wait-and- 
see approach which may result in no 
change in state over a large block of 
trials? . 
'These are all important questions, 
and we are certainly a long way away 
from solving most of them. It seems 
to the writer, however, that some sort 
of mathematical formulation will be 
most useful, and probably necessary, 
if an attack is to be made on many of 
them. As our questions get more 
subtle and we start to consider an 
increasing number of possibly rele- 
vant dimensions, Model |] may be 
expected to come into its own. For 
Model I, attractive though it is, can 
only be appraised through stat-rat 
simulation. These techniques are pos- 
sible, although not very powerful, in 
the simple case treated above. As 
more complicated situations are con- 
sidered, and more parameters are 
introduced, mathematical tractability 
becomes more than a luxury; it be- 
comes a necessity. It will always be 
possible to run a few groups of stat 
rats to demonstrate the plausibility 
of a given approach. But as soon as a 
large number of parameters is in- 
volved, mapping out the parameter 
space with stat-rat computations be- 
comes prohibitively expensive. It 
seems certain that analytic techniques 
must befound. And by extending and 
exploring models like Model II, one 
dares hope they will. 
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THEORETICAL NOTES 


PIAGET ON PLAY: 
A CRITIQUE 


BRIAN SUTTON-SMITH 1! 
Bowling Green State University 


It is contended that Piaget's attempt to reduce play to a function of 
thought is vitiated by his concern with a restricted class of intellectual 
operations and by his copyist epistemology. By resting the origin of 
representation on a copyist notion of imitation and by permitting 
play only the function of replicating concepts, Piaget deprives play 
of any genuinely constitutive role within thought. The contradictory 
consequences both in terms of the evidence about play's ontogenesis 
and Piaget’s ultimate resort to more characteristic affective and 


conative explanations are discussed. 


Piaget's (1951) Play, Dreams and Imi- 
tation in Childhood is the most recent of 
many attempts to make sense out of the 
theoretically anomalous subject matter of 
play. With the possible exception of psy- 
choanalytic play theory, it is the most con- 
ceptually elaborate account of play yet to 
be presented and, in addition, includes 
the best available examples of the se- 
quence of play activities in the first ycars 
of life. Tt is the intention of this paper to 
argue that Piaget’s thesis, to the effect 
that play may be interpreted functionally 
as an activity subordinated to adaptive in- 
telligence, leads to contradictions within 
his own system. The present analysis 
will maintain that Piaget’s epistemologi- 
cal assumptions and his concern with 
directed thought lie at the base of his 
misconstrual of play. 


Pracet’s EPISTEMOLOGICAL 
ASSUMPTIONS 


Piaget's treatment of play is insepa- 
rable from his theorizing on intelligence. 


1 This paper owes much to the stimulation 
of a visiting associate professorship in the 
Psychology Department and the Heinz 
Werner Institute of Developmental Psychol- 
ogy at Clark University, 1963-64. I am 
particularly indebted to Joseph Glick of Yale 
University and Bernard Kaplan of Clark 
University for their comments and sug- 
gestions. 


His root metaphor for the origins of in- 
telligence in human beings is the process 
of biological adaptation in lower or- 
ganisms. He is said to have been im- 
pressed in his earlier studies of mollusks 
by the way in which these lower organ- 
isms, while accommodating to the en- 
vironment, also actively assimilated it in 
accord with their own schemas of action. 


An image of an active organism which both 
selects and incorporates stimuli in a manner 
determined by its structure, while at the same 
time adapting its structure to the stimuli, 
emerged from these early studies as a ready 
made model for cognitive development [Fla- 
vell, 1963, p. 36]. 


The concepts of assimilation and accom- 
modation which were said to be applicable 
to lower organisms were also claimed to 
have applicability to the "intelligent" ac- 
tivities of higher organisms. These two 
became the indissociable functional in- 
variants in Piaget's conceptual system for 
the description of intelligence: 


Accommodation of mental structure to reality 
implies the existence of assimilatory sche- 
mata apart from which any structures would 
be impossible. Inversely, the formation of 
schemata through assimilation entails the 
utilization of external realities to which the 
former must accommodate .. . [1954, pp. 
352-353]. 
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Having made assimilation and accommo- 
dation ultimate categories of adaptive in- 
telligence, Piaget sought to express other 
(related) functions in these same terms. 
Imitation and play came to be formulated 
as particular types of relationships be- 
tween accommodation and assimilation. 
When accommodation to external reality 
dominated over assimilation, there ex- 
isted the state which Piaget termed imi- 
tation, Alternatively, if assimilation of 
external reality to preexisting concepts 
occurred, then there was the state to 
which the term play was given. Imita- 
tion and play were said to be polarized 
examples of the activity of intelligence: 
“Intelligent adaptation, imitation and play 
are thus three possibilities and they re- 
sult according as there is a stable equi- 
librium between assimilation and accom- 
modation or primacy of one of these two 
tendencies over the other [1951, p. 86]." 
From this description it follows that imi- 
tation and play should be explicable in 
terms of the same processes that determine 
adaptive oF intellective functioning. 
Leaving aside whether Piaget has shown 
that imitation is explicable in terms of his 
categories of adaptive functioning, it will 
be argued that his attempt to treat play 
within these categories is inadequate. 


Tre ASYMMETRY OF IMITATION 
AND PLAY 
vertones in Pia- 


There are Hegelian o 
n and play are 


get’s view that imitatio 
dialectical opposites which are gradually 
reconciled in a higher synthesis as the 
adaptive intellectual structures of logic 
progressively succeed them. But closer 
examination of the res ective roles of 
these two functions within his system 
suggests that imitation and play are not 
related as equipotent thesis and anti- 
thesis. Piaget verges, it is true, on the 
suggestion fhat in the origin of repre- 
sentative thought, imitation supplies 
M iay supplies Me a 

is antipho r al- 
a hrs biguously [1951, p. 3; 1960, 
the greater part, only 
ven a key role in the origin 
of concepts. Tmitation is said to be a 


source of representation, but play is used 
largely as an illustration of what repre- 
sentation is like. 

Piaget has criticized some of his prede- 
cessors, among other things, for their 
copy theory of knowledge or their naive 
realism. He has insisted that repre- 
sentative thinking is not merely a copy 
of external reality, but is rather the out- 
come of a long process of interaction be- 
tween the organism and the environment 
with each modifying the other over time. 
As indicated above, he has stated this 
interaction in terms of the concepts of 
assimilation and accommodation. It is 
very clear from the corpus of his writings 
that he does not favor a copyist episte- 
mology. Paradoxically enough, however, 
when the relationships between imitation 
and play are examined closely, the view 
emerges that Piaget has implicitly im- 
ported just such a copy theory into his 
own explanation of the processes of imi- 
tation, and that it is this implicit copyist 
notion of imitation which ultimately 
leaves play intellectually | functionless 
within his system. To illustrate: The 
process of accommodation is portrayed as 
establishing for the subject a photo- 
graphic-like negative of external reality. 
Imitation is said to be an extension of 
such negatives into positive action. “The 
function of imitation seems to be to pro- 
duce this set of ‘positives,’ which corre- 
spond to the ‘negatives’ of accommodation 
and which, at each new ‘printing,’ make 
new reconstitutions and anticipations pos- 
sible [1951, p. 84].” In due course, it is 
said, these imitations become interiorized 
as images, and then, later, when these 
images become attached to, and differen- 
tiated from, external symbols during in- 
telligent activity or during play, they are 
transformed into concepts. It follows 
that sustaining and initiating this whole 
process of representative activity are the 
encapsulated photographic negatives which 
occur during accommodation. It may be 
true to say, as Piaget does, that this ac- 
commodation is “an active copy and not 
a trace or a sensory residue of perceived 
objects [1960, p. 126; italics mine]," but 
accommodation is nonetheless conceived 
as a copy and given the initiating and sus- 
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aining role in his formulation of repre- 
h AB ctivity. i 
d s nce dbi that Piaget in- 
tends to say that, when the interiorized 
images become attached. to symbols in 
adaptive intelligent activity, this process 
of attachment changes the interiorized 
images. But whether or not this is the 
case, and this is not relevant here, it is 
certainly true that in Piaget's systematic 
accounts of play such an attachment of 
symbols is not said to bring about any 
such change in images. Piaget's defini- 
tion of play as predomination of assimila- 
tion over accommodation specifically pre- 
cludes any such change. Instead the 
encapsulated images derived from accom- 
modation determine the character of the 
activity. Play merely diversifies the 
symbols. As "reproductive assimilation," 
or "assimilation of reality to the ego," 
play can merely repeat; it can never 
originate. Wc thus have a situation in 
which the symbols of play are merely the 
reproductions of images preestablished 
through the copyist activity of imitatiori 
following accommodation. On this in- 
terpretation then, imitation is an essential 
factor in the constitution of representa- 
tive activity, whereas play is not. It has 
no essential role in the structure of in- 
tellect as conceived by Piaget. Intelli- 
gence cannot proceed without imitation. 
It can proceed without play. It is hardly 
necessary to emphasize that there are al. 
ternative points of view which grant to 
the processes of symbol formation a more 
active role in the constitution of thought 
than this copyist theory of Piaget's seems 
to imply (Werner & Kaplan, 1963), 
Greater internal cogency is given to the 
present critique, however, by showing that 
this implicit copyist notion of the origin 
of concepts leads to peculiarities within 
the system itself which are neither self- 
consistent nor consistent with the known 
data concerning play. 


al- 


Tue DISJUNCTION OF COGNITIVE 
AND AFFECTIVE FUNCTIONS 
Having deprived play of an active, in- 
tellective role in the constitution of new 
concepts, Piaget is forced into a series of 
substitute explanations. Unlike imitation 
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which has the one definition, play is pro- 
vided with a variety of definitions. At 
limes it is referred to as “reproductive 
assimilation" (1951, p. 87), as “general- 
izing assimilation" (1951, p. 100), but 
also as "pure assimilation" (1951, p. 87). 
"mere assimilation" (1951, p. 89), and 
"distorting assimilation" (1951, p. 102). 
The latter three terms do not occupy 
any systematic status within Piaget's 
theory of intelligence as do the first two. 
They are never defined, and it is not en- 
tirely clear just what they can mean, It 
is this lack of a functional role for play 
within the structure of intelligence which 
apparently forces Piaget to find alterna- 
tive nonintellective functional explana- 
tions for play, which he does not have to 
provide for imitation. It is said, for 
example, that play exists in childhood 


because thought is not adequate to its 
tasks: 


The prevalence of play among children is 
therefore to be explained not by specific 
causes peculiar to the realm of play, but by 
the fact that the characteristics of all be- 
haviors and all thought are less in equilibrium 
in the early stages of mental development 
than in the adult stage [1951, p. 147]. 


But why is there assimilation of reality to 
the ego instead of immediate assimilation of 
the universe to experience and logical 
thought? It is simply because in early 
childhood this thought has not yet been 
constructed, and during its development it 
is inadequate to supply the needs of 


: daily 
life [1951, p. 166]. 


. These quotations make it clear that play 
15 not considered as making any intrinsic 
contribution to thought, but is instead 
some sort of compensation for thought’s 
inadequacy. This interpretation is made 
explicit when Piaget says that the func- 
tion of 


ti play is to serve ego continuity. 
It follows that for the child ium 
of reality to the ego is a vital condition 
for continuity and development precisely 
because of the lack of equilibrium of this 
thought, and symbolic play satisfies this 
condition both as regard to signifier and 
signified [1951, p. 166].” But, surely, it 
is difficult to maintain that play is merely 
a polarity of thought, that the “prevalence 
of play . .. is therefore to be explained 
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not by . . . causes peculiar to the realm 
of play,” and then to invoke this concept 
of ego continuity which is not a concept 
intrinsic to Piaget's account of the nature 
of intelligence. This invocation of “ego 
continuity” does, in fact, suggest that play 
has some other function to serve which 
is quite distinct from thought. The 
notion of ego continuity as the function 
of play bears close resemblance to the 
concept of ego mastery as presented in 
psychoanalytic accounts, particularly that 
of Erikson (1950). But to rest play's 
positive contribution on such grounds is 
to lead to a disjunction of cognitive and 
affective functions, quite inconsistent with 
Piaget’s own repeated stress that the two 
are not separable. If play has a consti- 
tutive affective function of this sort (en- 
suring ego continuity). Piaget's general 
position on affective-cognitive relations 
requires that it should also have a con- 
stitutive intellective function. 


Tur DissUNCTION OF DIRECTED AND 


UNDIRECTED THINKING 


Tt would be wrong to imply that Pia- 
get's particular treatment of play stems 
perhaps from an accident in his episte- 
mology. The approach is more deep- 
seated than that for there is à consistency 
between his account of infant thought 
and his account of adult thought. Tt has 
been shown that in his treatment of in- 
e gives an initiating role to 
erive from accommodation 
to external reality. In adult thought also 
he is concerned with those coherent for- 
mal structures of intellect that have to do 
with accommodation to and control over 


the external world. In particular he is 
correspondence be- 


concerned with the p e 
tween "the structure? described by logic 
and the actual thought processes studied 


by psychology [1957, p. xvii]." He is 
not concerned with those less directed as- 
pects of adult thought usually referred 
to by such terms as reverie, creative 
imagination, O" divergent thinking. Yet 
in the opinion of some, the latter have à 
great deal to do with novel Lap ai 
adaptation. Tt would not be farfetche 
to speculate. in fact, that if there un 
intrinsic relationship between play 2n 


thought, it is more likely to be with these 
latter forms of divergent intellectual oper- 
ations than with the directed forms which 
concern Piaget. 

Piagets formulation of adult thought 
has a number of consequences which are 
consistent with the epistemological orien- 
tation already outlined. For example, 
his view that the thinking of the young 
child is relatively disequilibrial is "again 
similar to the empirical view which says 
that since thought copies reality, young 
children who are inexperienced will have 
more imperfect copies than older persons. 
Of course, stated in Piaget's terms, the 
young organism plays a more active role 
in acquiring its copies than was the case 
in the traditional empirical view, and un- 
like that view there is a telic quality to 
Piaget's formulation of the young child's 
They appear to be "'succes- 
sive approximations to a kind of ideal 
equilibrium or end state never completely 
achieved [Flavell, 1963, p. 47. The 
difficulty with this approach for Piaget's 
own point of view is that it is not con- 
sistent with his stage theory which says 
that each stage of thinking has a charac- 
teristic organization and completeness of 
its own. It is difficult to see how there 
can be both “equilibration” of early stages 


and the view that these early stages are 
relatively less in equi 


librium than later 
stages. The problem can 


operations. 


be located in 


the different usages of the term “equi- 
librium” which Piaget employs both to 
refer to characteristics of intellectual or- 


ganization at one time or stage and to 
refer to the ontogenetic status of the or- 
ganism in general. Apart from the se- 
mantic confusion in having such entities 
as *disequilibrial equilibriums," either 
stage or ontogenctic usage leads to diffi- 
culties with respect to the definition. of 


as a polarity of thought. 


play 
Tf it is said that when an intellectual 


structure is disrupted by intractable data 
the lack of intellectual resources in child- 
hood (the disequilibrial ontogenetic 
state) leads to the use of play as an af- 
fective holding action, this is again to 
give play no genuine cognitive function. 
Vet, if it said that play is a true polarity 
of thought in the sense of reestablishing 
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ilibrium through genuinely cognitive 
ama at any one stage (the implied stage 
theory), this is equally unfortunate E 
the theory as it stands. If, for example, 
some divergent thinking function is at- 
tributed to the “distorting assimilations 
of play, this requires a reconsideration of 
the role of divergent thinking not only 
in children’s thinking but also in the de- 
velopment of adult thinking. If play does 
not simply “distort” old concepts, what- 
ever such a phrase can mean, but also 
originates new concepts, then the whole 
account of the genesis of intellectual 
structures from childhood through adult- 
hood has to be changed. 

Piaget appears to have chosen rather to 
make a disjunction between the operations 
of thought in adulthood and the opera- 
tions of thought in childhood, in addition 
to the formal differences between sensory- 
motor, concrete, and abstract operations 
which are already a part of his theory. 
Thus it is said that childhood thinking is 
bolstered by the affective operations of 
play, but the thinking of adults is not. 
From Piagets viewpoint as undirected 
thinking, fantasy, play, etc., are specifically 
childlike and mainly compensatory, that 
is, having nothing to do with the develop- 
ment of particular kinds of intellectual 
operations, they may be confined to the 
infantile stage and regarded as irrelevant 


to the nature of adult intellectual oper- 
ations. 


THE ONTOGENESIS or PLAY 


It is legitimate to ask whether the evi- 
dence on play supports Piaget's point of 
view. His view that play is merely a 
buttress to an inadequate intelligence 
leads to the corollary that as that intelli- 
gence increases in efficiency and adequacy, 
play will cease to be important in the de- 
velopment of the mind. In Piaget's uni- 
linear account of the ontogenesis of 
intellectual structures, play has such a 
transient position. "Thus it is said that 
from about the age of 4 years onwards 
symbolic games decline because the child 

becomes capable of greater adaptive ac- 
tivity and is more realistic: “. . . progress 
in socialization instead of leading to an 
increase in symbolism transforms it more 
or less rapidly into an objective imitation 


of reality [1951, p. 139]." “Rule games 
. mark the decline of children's games 
and the transition to adult play, which 
ceases to be a vital function of the mind 
when the individual is socialized [1951, p. 
168]. On the basis of the evidence 
about play several objections can be raised 
to the view that the symbolic games of 
childhood are simply replaced by more 
realism with age, and that play which is 
"vital" in childhood ceases to be vital 
in adulthood. In general it can be main- 
tained with equal cogency that, rather 
than a decrease in the symbolic play 
function with age, what we actually find 
is a shift in the applications and thc dif- 
ferentiation of this function. First, the 
early rule games of children continue to 
be heavily loaded with symbolic elements 
which play a part in determining out- 
comes equal in importance to skill. In the 
host of singing games of young girls, 
for example, it is the one who is arbi- 
trarily chosen as Punchinello or as Far- 
mer-in-the-Dell who determines the out- 
come. Similarly, in the great group of 
central-person games of early childhood 
which model the family drama, it is the 
player chosen by the arbitrary counting- 
out processes or the player with the role 
vestments of Caesar or Red Rover who 
decides by fiat when the game will start, 
when the players will run, etc. (Sutton- 
Smith, 1959), Here the play symbolism 
has been collectivized, but it has hardly 
been decreased, or given way to greater 
realism. Tn Piaget's (1932) treatment of 
games he was concerned with a game 
(marbles) in which the symbolism was 
much less Pronounced, marbles being 
more age appropriate at the end of early 
childhood, around -10 years, whereas 
fiese other games just mentioned tend to 
more in ne ough ens. Farther 
——— ise, play at games was 
used to illustrate moral and social de- 
velopment rather than to provide the basis 
or an understanding of the play function. 
In a sense, Piaget (1932) first uses games 
to illustrate the development of morality, 
while he later (1951) uses play to illus- 
trate the development of thought. Neither 
time does he deal systematically with the 
peculiar functions of games or play in 
human development. Perhaps it is this 


$ 
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tendentiousness of approach that leads 
Piaget to discount the continuing influ- 
ence of play in psychological development. 
Even when children's games are mainly 
matters of physical or intellectual skill, 
as from 10 years onwards, however, it is 
difficult to accept Piaget's view that this 
is simply a matter of play being converted 
from a symbolic to a realistic application. 
The total game performance still remains 
basically a symbolic procedure. There is, 
after all, little that is precisely rational 
about a game of football or a game of 
hasketball. In fact, there is support for 
the point of view that such sports are best 
conceived as modern ritualized dramas 
of success and failure (Sutton-Smith, 
Roberts, & Kozelka, 1963) and as such 
have little that is either rational or real 
about them. Furthermore, it is the thesis 
of Huizinga (1949) that play (if not 
sport) does continue to be a vital func- 
tion of mind even in socialized adults and 
that it permeates all their vital activities 
including those of a cultural sort and 
those of an economic and political nature. 
Recent cross-cultural work showing the 
full implication of games in culture woul 
appear to substantiate this view (Roberts, 
Arth, & Bush, 1959; Roberts & Sutton- 
Smith, 1962; Sutton-Smith & Roberts, 
1964). Though preschool egocentric sym- 
bolic play may decrease with age, play 
nevertheless finds expression in the midst 
of a variety of other cultural and social 
forms. It is thus not displaced by real- 
ism or by greater rationality, nor does it 
cease to be a vital function with age. In- 
stead it becomes more differentiated an 
more representative in its contents of the 
other forms of human „development. 
Without such a point of view it 1S dif- 
ficult to understand the verbal play of 
adults, their social an 
rituals and their carniva 
and fairs, and their widespread 
versified playfulness. A n 
This present criticism. of Piagets view 
f play as 4 transient, infantile stage 1m 
one rgence of thought parallels Vygot- 
the emerg "neism of Piaget’s notion 
i eech as à transient, in- 
o! F apante the emergence of social- 
otsky maintained that 


ized Speech a of ontogeny egocentric 


in the cours 
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speech does not give way to socializ 
speech, but becomes Ker ME. ey 
ner speech which may itself be didieen- 
tiated according to autistic or logical 
ends. Similarly, it has been ynaintilied 
here that children's play does not give 
way to intelligent adaptation, but becomes 
differentiated in a variety of ways. The 
difference between Vygotsky's criticism 
and the present one, however, hinges on 
the fact that both Piaget and Vygotsky 
are talking about the ontogenesis of the 
same function—speech—whereas here two 
functions are involved, intelligence and 
play, which Piaget attempts to amalga- 
mate into a single function (intelligence) 
and an aborted variant (play). It should 
be noted in conclusion that Piaget's vari- 
ous earlier treatments of undirected think- 
ing in children, their animism, artificial- 
ism, etc., aS transient, infantile stages in 
development are susceptible to similar 
criticism (Piaget, 1928, 1929, 1930). 


CONCLUSION 


ended that Piaget's 
directed rather than 
1 operations together 


Tt has been cont 
major concern with 


undirected intellectua 1 
with his implicit copyist epistemology 


have made it impossible for him to deal 
consistently with play. He has attempted 
to make it a function of thought without 
giving it any intellective function within 
thought. And he has been unable to give 
it such a function within thought because 
in his epistemology concepts are ulti- 
mately copies derived from and appropri- 
ate to an external reality. As play does 
not copy but “distorts” reality, it can have 
no intrinsic place in such an epistemology. 
Asa result, Piaget js forced finally to ex- 
plain play in terms of the type of affective 
and conative functions that are familiar 
in other play theories. (This is not to 
imply that there are not parallels between 
cognitive operations and play contents at 
given ages. It is the virtue of Piaget's 
observations that in this respect they 
complement those of psychoanalysts and 
sociologists which tend to be highly se- 
lective in terms of play content reflecting 
their own characteristic theoretical ori- 
entations. But the existence of such par- 
allels between the operations of thought 
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and the structures of play CONVEYS nrothing 
in itself about the function of play.) mn 
While it has not been the ciae - 
this paper to propose any alter peo 
retical approach to play or to cuita 
Piagets play theory from eje 
oints of view, a brief mention of an al- 
r A epistemology and the quite dif- 
ferent approach to play which. it entails 
may serve to highlight. the legitimacy of 
the present criticism in epistemological 
terms. Ernst Cassirer (1953) explicitly 
opposes the view that concepts are copies 
of external reality. They are rather 
modes through which reality is con- 
structed on a level transcending the simple 
sign functioning of lower organisms. 
From an epistemologically constructivist 
viewpoint of this sort play might well be 
taken as a very positive illustration of the 
thinking process because it involves the 
construction of symbols to create new con- 
ceptual domains. Though from Cassirer's 
point of view, play would have to be just 
one of many such symbolic modes with its 
own characteristic properties, others being 
art, myth, science, etc., for Cassirer's 
point of view with respect to symbolic 
development is not only constructionist, 
it is also multilinear. Piaget with his 
contrasting copyist assumption and his 
unilinear notion of thought is forced to 
reduce all forms of symbolic activity to 
the single copyist form of adaptive 
thought. And on the evidence of the in- 
ternal inconsistencies to which this leads, 
his approach does not appear to have 
been fruitful for the explanation of play. 
The present criticism of Piaget's theo- 
retical system which focuses on the in- 
adequacy of his epistemology when ap- 
plied to play probably has relevance for 
the interpretation of divergent-thinking 
activities in general, such as creativity, 
originality, expressiveness, etc. At the 
same time this criticism probably has less 
relevance to such convergent-thinking 
operations as the understanding of physi- 
cal causality, spatial relations, etc., with 
which Piaget is mainly concerned and for 
which his copyist assumption may be 
more intrinsically relevant. For whatever 
may be said here about Piaget's theory of 
play, there is no intention to detract from 


the conceptual flexibility of his system 
when applied to the child's understanding 
of the operations of the physical world. 
In that area man and mollusk share a 
common concern, and Piaget's episte- 
mology appears to be revealing. 
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. The interesting criticism that. Sutton- 
Smith has made of my play theory calls, 1 
feel, for a brief reply. I must confess 
that I have some difficulty in recognizing 
the opinions that this author attributes to 
me, and I think that his formulations of 
them derive from the fact that he has only 
been able to assess that portion of my 
work that has been translated into 
English. 

I have never, either implicitly or ex- 
plicitly, expressed the view that knowl- 
edge is a copy of reality because such a 
view is contrary to my position with re- 
spect to the nature of intelligence. My 
whole conception of intellectual operations 
is based on the premise that to know 
or to understand is to transform reality 
and to assimilate it to schemes of trans- 
formation. In particular, I have never 
said that representative or symbolic 
thought, including concept formation, is 
derived from imitation (in the form of an 
accommodative copy). On the contrary, 
in my view, all concepts are derived first 
irom the action and then from the oper- 
ation, which is another way of saying 
that concepts are the expression of an 
assimilation by schemes of transforma- 
tion. 
In fact, the reality that intelligence 
tries to grasp consists of à series of 
states (A,B,C, etc.) and transformations 
which modify these states (A into B, B 
into C, etc.). One can, therefore, dis- 
tinguish two components of cognitive 


functions: 
There is first of all a figural component 


that does not itself constitute a CODY but 
rather a more or less approximate de- 
scription of reality states and their con- 
figurations. This figurative „component 
is derived from perception, imitation, and 
imagery (graphic or mental) or from 1m- 
teriorized imitations. 

There is secondly 
ponent which takes accout 


a cognitive com- 
at of transforma- 


tions and wW hich builds upon sensorimotor 
cape interiorized actions, and finally, 
Su ds tat nal i Lii EEE, 
In other words bur aon Mpeg 
. ords, n studying percep- 
tion, mental imagery, etc. (research not 
yet published in English), I have tried to 
show that the figural aspects of cognitive 
functions are never sufficient to explain 
representative or conceptual knowledge. 
Indeed, the figurative components only 
play a useful role to the extent that they 
are subordinated to the cognitive com- 
ponent. It seems clear to me that knowl- 
edge of a “state” cannot be reduced to a 
copy of this state, but necessarily consists 
in an assimilation of this state to a pre- 
operational scheme or @ postoperational 
concept. In fact, to understand a state 
one must understand the transformations 
irom which the state results which, let us 
repeat, excludes all types of copy knowl- 
edge. 
The role of play in this system seems 
Play is an exercise of action 
efore part of the cogni- 
onception. At the 


clear. 
schemes and ther 
tive component of c 
same time, however, play manifests the 
peculiarity of a primacy of assimilation 
over accommodation which permits it to 
transform reality in its own manner 
without submitting that transformation 
to the criterion of objective fact. Re- 
gardless of what Sutton-Smith says, 
fits into this system without becom- 


play 
ing subordinated to accomodative imita- 
tion. Imitation only plays the role of a 


symbolic instrument from the moment 
that sensorimotor play becomes symbolic. 
This last point leads us to an examination 
of the symbolic function. 

The sensorimotor functions (including 
sensorimotor play and perception) are not 
symbolic inasmuch as they only use undif- 
ferentiated signifiers (indexes or signals). 
The symbolic function begins around 14-2 


years of age with the appearance of dif- 
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ferentiated signifiers (signs and qe 
and consists in representing inem E 
urative and cognitive schemes by ari s 
f these signifiers. In other words, the 
2 bolic signifiers are derived from imi- 
pe which, before becoming interior- 
ized, is already a kind of symbolization in 
action. This fact, however, in no way 
implies that symbolic instruments should 
be confused with figural aspects of 
thought. Perception is figurative, but not 
symbolic, while language is symbolic (in 
the broad sense), but not figurative. In- 
teriorized imitation and images are, on 
the contrary, at the same time figurative 
and symbolic. Mental imagery in par- 
ticular is the product of interiorized imi- 
tation and not the simple residue of per- 
ception as it was previously believed. 

If symbolic play uses imitation, it is 
exclusively as a symbolic instrument. 
This follows because there are only two 
ways that an absent situation can be rep- 
resented; it can either be described by 
language or evoked by imitative gestures 
or images. This in no way means, how- 
ever, that symbolic play can be reduced 
to imitation since play is exclusively an 
assimilation of reality to the self. None- 
theless, since it is symbolic it needs signi- 
fiers, and it borrows them either from 
language or from the only other source of 
symbols, that is to say, gestural or in- 
teriorized imitation. 

It seems clear now that the criticisms 
that Sutton-Smith has made of our theory 


THEORETICAL NOTES 


are unfortunately the result of a series of 
misunderstandings and that the opinions 
that he attributed to us, which indeed de- 
serve to be criticized, are fortunately not 
our views. There remains, however, one 
other point that must be clarified. The 
author defends the idea that symbolic play 
does not diminish but rather differentiates 
during the course of development. This 
is certainly true in one sense of the word. 
In becoming differentiated, however, it at 
the same time becomes more and more 
adequately adapted to reality (construc- 
tion games, etc.). This is the sense in 
which I speak of play as diminishing 
with age. And it does diminish if one 
takes, as I do, the essential property of 
play to be the deformation and subordi- 
nation of reality to the desires of the self. 

Finally, Sutton-Smith makes an allu- 
Sion to Vygotsky's conception of ego- 
centric language. I have already replied 
in length to Vygotsky's insightful re- 
marks. However, if we are to take up 
the problem of egocentrism again, which 
goes hand in hand with the deforming 
processes of assimilation, we must also 
situate language and play in the overall 
context of the individual and social ac- 
tions of the child. In studying the col- 
laboration of children in collective ac- 
tions, R. F. Nielsen has found an evo- 
lutionary law which resembles to a great 
extent what I used to call the passage 
from egocentrism to cooperation, 
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Careful study of the implications 
relating drive to stimu 
predictions result from changing 


tions. 


of stimulus generalization (GSGs) 
However, 
GSGs with increased or decreased drive, and all 
r raised or lowered GSGs will result from 
depending upon boundary conditions. 


GSGs with decreased drive. 
cither flatter or steeper 
theories predict that eithe! 
increased or decreased drive, 
The boundary 


sidered in making rigorous deductions ar 
f stimulus sampling probability or energizing 


relative predominance o! 
effects of drive as compar 
and final levels of drive, streng 


of the threshold. 
ables, clear predictions from the t 


Relationships between drive and 
stimulus generalization have been 


central considerations in theoretical 


formulations and investigations deal- 
reas, such as 


ing with a variety of a i 
approach and avoidance behavior 
(Brown, 1942, 1948), generalization of 


operant responses (Jenkins, Pascal, & 


Walker, 1958; Thomas & King, 1959), 
displacement (Miller & Murray, 1952), 
and schizophrenic behavior (Broen, 


Storms, & Goldberg, 1963; Mednick, 
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of Estes’ theory and D X H theories 
lus generalization reveals that highly varied 


the specification of boundary condi- 
are flatter gradients 
with increased drive and steeper 
the D X H theories predict 


which must be con- 


e described. These include the 


fects, initial 
and height 
es in measuring these vari- 
Il often be impossible. 


1958; Rodnick & Garmezy, 1957). In 
spite of the considerable theoretical 
importance and experimental focus 
on relationships between drive and 
gradients of stimulus generalization 
(GSGs), conclusions regarding these 
relationships have been varied and 
disparate. Increased drive has been 
credited with such effects as increasing 
elevations of GSGs (Mednick & 
Freedman, 1960), increasing (sharp- 
ening) their slopes (Spence, 1958), 
flattening GSGs (Estes, 1958; Jenkins 
et al., 1958), and sharpening and 
flattening GSGs as well as raising 
and lowering them (Thomas & King, 
1959). Estes stimulus sampling 
theory and D X H energizing theories 


critical reading ese Mn 
article and many helpful comments anc o 1 
Irwin Levin for computational aid. such as those of Hull and Spence yield 
à 
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different predictis i ege. Sow 
effects of drive chang diü 
partially depending upon conditions 
her than drive conditions (Estes, 
1038, 1959b ; Hull, 1943, 1952 ; Spence, 
ry 1958). Itis not at all clear that 
the apparent inconsistencies in the 
evidence are inconsistent with any 
of these theories since quite different 
results might be predicted for differing 
experimental conditions. It is the 
purpose of this article to make explicit 
the implications of these theories for 
relationships between drive and stim- 
ulus generalization and to discuss the 
variables and conditions relevant to 
designing appropriate experimental 
tests. 
Our interest will focus on the effects 
of drive on GSGs where a response 
has been trained to a stimulus or class 
of stimuli, and tests are made with the 
training stimulus and other stimuli on 
the stimulus continuum under various 
drive conditions. Thus, the effects of 
drive on discrimination learning and 
tests involving simultaneous presenta- 
tion of more than one stimulus will 
not be considered. Although one 
response is trained to a single stimulus 
situation, all theories under considera- 
tion acknowledge that other responses 
may possess some probability of oc- 
currence, or some strength (except, 
according to Estes, when some re- 
sponse has a unit probability of occur- 
rence). Therefore, each stimulus will 
be considered associated with more 
than one possible response. 


ESTES’ THEORY 


In Estes’ (1959b) theory, on any 
trial only a sample of the set of 
stimulus (S) elements of which a 
stimulus is composed act on the re- 
ceptors of an organism. Whenever a 
given S element is sampled just prior 
to a reinforced response, that S 
element becomes conditioned to the 


response. Conditioning is at full 


strength after one such trial. At any - 


time, an S element can be conditioned 
to no more than one response. S 
elements available for sampling are 
sampled randomly according to their 
sampling probabilities. The proba- 
bility of a response is equal to 
the proportion of the available S ele- 
ments which are conditioned to the 
response, with all elements weighted 
by their sampling probabilities. Re- 
sponse probability can be increased 
by adding conditioning trials so that 
more S elements become conditioned 
to the response, by increasing the 
proportion of available S elements 
conditioned to the response (e.g., 
bringing in an additional stimulus 
previously established as a condi- 
tioned stimulus for the response), or 
by increasing the sampling probabili- 
ties of the S elements. Response 
probability can be decreased by de- 
creasing the proportion of available S 
elements conditioned to the response, 
or decreasing the sampling probabili- 


ties of some or all of the S elements 
conditioned to the response. 


Drive Effects in Estes’ Theory 


Although Estes (1959b) does not 
discuss drive in his formulations of his 
general theory, it is well established 
that drive changes have stimulus 
effects. Thus Estes (1958) offers a 
separate, straightforward interpreta- 
tion of drive effects in terms of his 
stimulus sampling theory. Basically, 
drive changes result in altered re- 
sponse probabilities because of changes 
in the drive stimuli available for 
sampling and changes in the sampling 
probabilities of drive stimuli. 

Intended to apply only to hunger 
and thirst, Estes’ formulation of drive 
effects involves four classes of cues: 
satiation cues (F), such as stomach 
distension ; deprivation cues (D), such 
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TRAINING 
(Low Drive) 


Bre m 
(Deprivation Cues) 4 


T 
ge 
(Satiation Cues) L 
En 
- 
(Training Stimulus) 


X 


(Extraneous Cues) 


Fic. 1. Classes of cues in 
of drive changes. (The width o 
represents the average sampling 


increase is depicted.) 


ions or dryness of 
trolled or manip- 
rimenter (the 
, such as pitch or loud- 
d extraneous Cues 


as stomach contract: 


ness of a tone; an 
(X), such as noises 
i class of 
er of such cues 
i i the average 
available for sampling and 
sampling probability of those cues. 
The theoretical probabi io a t 
response is equal to t ne pro 
Pi classes of cues conditioned to AE 
response (weighted by the appropri 


meh 
sampling probabilities) . 
weight "of all cues available for 


sampling. š 

Figure 1 illustrates the changes 
associated with a drive increase be- 
tween training and testing and pre- 
sents the notation to be used in 
further discussion. D and F are the 
sampling probabilities of deprivation 
and satiation cues, respectively, be- 
fore drive is changed, Ni and N. are 
the numbers of deprivation and 
satiation cues available for sampling. 
D' and F' are the sampling probabili- 


ties of deprivation and satiation cues 


Estes’ drive theor 
f a rectangle represents the number of cues, while the height 


probability of those cues. 


y and notation used in discussion of effects 


For illustrative purposes, a drive 


after the drive increase. Changes in 
the sampling probabilities are repre- 
sented by AD and AF. Mı and M» 
are the numbers of deprivation and 
satiation cues available for sampling 
which were not present during train- 
ing. Sand X are the total weights of 
the experimental and the extraneous 
cues. In Figure 1, the white areas 
represent the weights of cues condi- 
tioned to the response, while cross- 
hatched areas represent the weights of 
cues not conditioned to the response. 

Assumptions Concerning the Effects 
of Drive. A few assumptions regard- 
ing the effects of drive changes on 
stimuli available for sampling and 
their sampling probabilities make 
possible rigorous deductions concern- 
ing the theoretical effects of drive 
changes on GSGs. 


1. It is assumed that the average 
sampling probability of deprivation 
cues (D) increases monotonically and 
that of satiation cues (F) decreases 
onotonically with increased drive. 
2. The deprivation and satiation 
s available for sampling are as- 


m 


cue 
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sumed to vary systematically with 
drive changes. If testing drive of 
different from training drive, some o 
both classes of cues which were avail- 
able during training will be replaced 
by cues which were not available for 
sampling. 'To simplify deductions, it 
is assumed that the number of de- 
avati cues and the number of 
satiation cues available for sampling 
remain constant. 

3. The combined weight of depriva- 
tion and satiation cues increases with 
increased drive and decreases with de- 
creased drive, that is, N AD + NAF 
is positive for increased drive, nega- 
tive for decreased drive. 


The following formulas follow di- 
rectly from the above assumptions, 


which seem to underlie Estes’ (1958) 
presentation. 


Let G denote overlap with the 


Pai sn (Ni — M)D 
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training stimulus, with G = 1 denot- 
ing the training stimulus, G = 0 
denoting a stimulus so remote that 
it has no overlap with the training 
stimulus, and 0< G< 1 denoting 
intermediate degrees of overlap. GS 
is the overlap of generalization stimuli 
with the training S. If, asin Figure 1, 
training has been carried to asymptote 
so that all stimulus elements except 
extraneous cues are conditioned to the 
responses, then the response proba- 
bility at the end of training for any 
stimulus on the S continuum is: 


NıD + NoF + GS " 
NMDYNFESLX M 


Prrsin "e 


When testing is carried out under a 
different drive from that present 
during training, response probability 
1$ given by: 


+ (N: - Mr 


ND' + MF + SLX 
In order to predict the effect of a drive chan 
on a GSG, we require a formula for Pie. = 


Pres can be written: 


[MD + NoF + GS].+ [NAD + NAF — 


+ GS 


[2] 
ge between training and testing 
Psstas 


Then 
Pigi idi P train 


[MD + ISF +S +X] + [NAD 


MiD' — M;F!'] 
+ NAF] ` 


[MAD + NAF — MD! — M,F'ENSD + N:F +S + X] 


— [MAD + NSAF]EN 


~ [ND + NaF + S+X]+ [NAD + NAF] 
By expanding and grouping terms and letting T 


we arrive at: 


Pus Pain xp 


[S + X — GS]LN,AD + NSAF] — EM, 


iD + NF + GS] 
[ND + MN. SHXT 


=MD+NF+ 5 TX, 


[T+ NAD + N.AF]T 


From Equation 3 a number of 
predictions can be made concerning 
the effects of drive changes on 
GSGs. Drive decreases will be con- 
sidered separately from drive in- 


+ GFP 


creases. 
upon 
they 


Where predictions depend 
assumptions within the theory, 
will be stated in the context of 
the relevant boundary conditions. : 
Lower Testing Drive than. Training 
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Fic. 2. Predicted effects of decreased drive on GSGs (left) and several possible effects 
of increased drive based on Estes’ theory (right). 


Drive. In this case N;AD + N2AF is 
negative by assumption. The follow- 
ing predictions are derived from the 
formula for Prost — Pain 

1. The denominator in Equation 
3 is always positive, since T is the 
total weight of all stimuli in the 
training situation, and the addition of 
N,AD + NsAF to T yields the total 
weight of all stimuli in the test 
situation. Since S + X — GS is also 
positive while N;AD + N.AF is nega- 
tive, and the positive expression, 
(MD! + M:F')T, is preceded by a 
s sign, the value of Pus — Perain 
Thus, the gra- 
1 at all GS. 


verlap of test 
r the value of 


minu 
is always negative. 
dient is lower at S anc 

2. The less the o 
stimuli with S, the lowe f 
G and the higher the value of S TX 
— GS. Thus, at more remote stimuli, 
Prost — Pirain is a larger negative value 
than at the training stimulus. Hence, 
reduced drive steepens the GSG. 

The lower and steeper gradients 
resulting from reducing drive between 
training and testing arefillustrated in 


Figure 2. 

Greater 
Drive. By assumpue 
NiAD + N:AF is posi 
Several predictions follow: : 
iven test stimulus, Pe 
E pha upon the relative 


Testing Drive than T! raining 
tion, the value of 
tive in this case. 


weight of NAD + NsAF as compared 
to MiD' + MsF'; that is, essentially 
the change in sampling probabilities 
relative to the change in cues available 
for sampling. These two factors work 
in opposite directions with respect to 
the elevation of the GSG, NiAD 
+ NAF tending to raise the GSG, 
MD! + M2F’ tending to lower it. 
Thus, the GSG can be either raised 
or lowered by increased drive depend- 
ing upon the values specified for these 
variables. 

2. In Equation 3, for a given drive 
change, all quantities are constant 
over variations in the experimental 
stimulus with the exception of GS. 
Since N;AD + NSAF is a positive 
value for increased drive, and S+ X 
— GS becomes a larger positive figure 
as G decreases, Prest — Ptrain is greater 
at more remote stimuli than at the 
training S. Thus, drive increases lead 
to flatter GSGs. 

3. If Prost — Pernin is negative at the 
training .S, at remote stimuli Prest 
— Prin may be positive since 5 + bas 
— GS is greater at remote stimuli. 
Thus, increased drive may decrease 
response probability at some stimuli 
and simultaneously increase resporise 
probability at other stimuli with less 
overlap with the training S. This 
"crossover" is more likely to occur if 
the major effect of increased drive 
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consists of changes in sampling proba- 
pilities (an effect represented in 
Equation 3 by NAD + NAF), and 
it is unlikely to occur if the major 
effect of increased drive is through 
changes in the drive cues available 
for sampling (represented by M,D' 
+ MoF’). Again the prediction de- 
pends upon assumptions regarding 
the influence of N,AD + NAF rela- 
tive to MiD' + MoI’. 

Figure 2 illustrates several of the 
possible effects of increased drive 
including higher and flatter GSGs, 
flatter GSGs which cross the GSG at 
the training drive, and lower and 
flatter GSGs which do not cross the 
GSG at the training drive. Not all 
possible predictions are represented ; 
for example, the noncrossover GSG 
below the GSG at the training drive 
could be higher at the training stim- 
ulus than the GSG which crossed. 

A More General Formula. It will 
be recalled that in Figure 1 and in the 
formulas upon which the above pre- 


Pes — Pain 


dictions were based it was assumed ` 


that training had been carried to. 


asymptote so that all but extraneous 
cues had been conditioned to the re- 
sponse. More general predictions 
which include the many situations in 
which training has not been carried 
to completion would be of interest. 
To describe these more general 
conditions, some additional notation 
is needed. Let N, and S be the 
number and sampling probability of 
experimental cues and Li, Ls, and Ls 
be the numbers of deprivation, satia- 
tion, and experimental cues, respec- 
tively, which were not conditioned to 
the response at the end of training. 
M; and M; represent the numbers of 
drive and satiation cues which were 
conditioned to the response at the 
end of training, but were replaced by 
unconditioned cues as a result of a 
change in drive before testing. The 
formula for the change in gradients 
resulting from a change in drive is: 


(N:S + X + LD + LiF — (Na — L3)GS]LNiAD + NAF] 


K — TULAD + LAF] — TLMiD' + M:F'] 


LT + NiAD + NsAF]T - [4] 


To some extent, the predictions 
from this formula differ from those 
stated above. However, it is again 
predicted that decreased drive will 
produce steeper GSGs, and increased 
drive, flatter GSGs. The steepening 
or flattening will be less marked the 
larger L; is relative to Ns. 

Using this more general formula, 
the predictions concerning whether 
increased drive will raise or lower 
GSGs remain essentially the same as 
in Figure 2. The situation with 

respect to decreased drive is more 
complex and depends on assumptions 
about the different rates of change in 


the variables involved. Decreased 
drive may raise the GSG under some 
conditions rather than lower it. These 
conditions depend upon the values of 
Ni, No, Ns, Li, Lo, La, Mi, Ms, etc., 
as specified in Formula 4.? 


2For example, if the relative weight of 
deprivation and satiation cues conditioned to 
the response is initially less than the relative 
weight of experimental and extraneous cues ' 
conditioned to the response, and if there is no 
change in drive cues available for sampling 
and no change in the weight of drive cues 
conditioned to the response relative to the 
weight of all drive cues, the probability of the 
response will increase with decreased drive. 
TThis is because the total weight of drive cues 


d 


If as a result of reduced drive the 
. GSG is raised at the training stimulus, 
a crossover effect would be predicted 
for larger absolute values of NiD 
+ NF relative to MiD' + MoF’. 


= e 
THE RESPONSE ACTIVATING OR 
ENERGIZING ACCOUNTS 
or DRIVE 


In this section we will consider a 
family of theoretical accounts which 
interprets the effect of drive on stim- 
ulus generalization primarily in terms 
of a multiplicative effect of drive on 
the habit strength of a response (e.g., 
Broen et al., 1963; Hull, 1952, 
Ch. 8; Rosenbaum, 1951, 1953; 
Spence, 1958). Although these ac- 
counts are based on the Hullian treat- 
ment of drive as a response energizer, 
none conforms strictly to the Hull- 
ian postulate system, which includes 
M ema regarding the effects of 
SOR (QUE in drive-stimulus intensity on 
mM strength (Hull, 1952, Corol- 
ae 2). Hull was dissatisfied with 

reatment of stimulus intensity 
(1952, pp. 89, 220) and chose to 
illustrate the effect of changed drive 
ona gradient of stimulus generaliza- 
tion with computations based solely 
on the energizing aspect of drive 
(p. 225). The other writers cited 
above have followed suit. In the 
family of accounts discussed here the 
Possible effects of the stimulus aspect 
of drive are not denied. However, 
Since these accounts relate SG only 
to the energizing aspect of drive, they 
imply that the major effects of drive 
on SG can be accounted for in terms 
of the energizing function of drive. 


decreases as drive decreases (see Assumption 
However, if the total weight of drive 
cues decreases very slightly and the number 
of deprivation and satiation cues available 
or sampling changes substantially, the 
Probability of the response will decrease wit 
€creased drive. 
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The accounts of the relationship 
between drive and SG which em- 
phasize the energizing aspect of drive 
are based on, or seem to imply, the 
following theoretical specifications: 


1. When a response has been 
trained to a stimulus, the habit 
strength (H) of the response will gen- 
eralize to other stimuli, with the 
generalized H being a monotonic de- 
creasing function of stimulus dis- 
similarity. 

2. The same stimulus may evoke 
more than one habitual tendency to 
respond (e.g., Hi, a dominant re- 
sponse, and H», a competing response). 

3. At any stimulus, drive (D) 
multiplies the habit strength of all 
tendencies to respond elicited by that 
stimulus. 

4. With other variables held con- 
stant, reaction potential (E) is the 
product of D X H, and thus E: — Es 
= D(H, — Hj. E oscillates in 
strength from moment to moment, 
the momentary Es being distributed 
approximately normally. Competing 
responses have equal standard devia- 
tions of momentary Zs resulting from 
oscillation, and competing Es oscil- 
late asynchronously. Despite varying 
statements regarding oscillation in the 
different accounts, the above state- 
ment reflects computing practices 
which have been quite consistent. 

5. In order for a response to occur, 
at the time of testing its momentary 
E must be stronger than the momen- 
tary Es of competing responses, and 
must be stronger than a threshold 
value which is constant and above 


Zero. 


Energizing Accounts and GSGs 

In discussing implications of the 
above account, the probability of a 
response (Pr) will be used as the 


dependent variable because of ease 


and directness of derivation from the 
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RESPONSE PROBABILITY 
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o 


Fic. 3. 


Some hypothetical 
different drive levels in a situation without 
competing responses computed according to 


GSGs at 


Spence’s theory. (Time-limited trials which 
are brief enough to rule out within-tria] Æ 
variation are assumed. Longer trials would 
give higher response probabilities, Under the 
lowest drive level illustrated, D, E-Lz-,.5 
at Sr and .1 at Sy. Threshold = 1.0, These 
values are in terms of the standard deviation 
of momentary E; E minus oscillation.) 


energizing account and comparability 
with derivations from Estes’ account. 


one in which a 
response has been trained to the 
stimulus Sp following which observa- 
tions are made on the first test trial 
at either Sp or Separate presentation 
of a generalization stimulus, S,. 
Thus, the Tesponse probabilities in 
Figure 3 are simply the proportions 
of the distribution of momentary Es 
Which are above threshold at a specific 
level of drive. 
Consistent with the 
illustrated in Figure 3, 


implications 
most writers 
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using this account (Dollard & Miller, 
1950, p. 53; Hull, 1952, p. 225; Knopf 
& Fager, 1959; Mednick, 1958; Rosen- 


baum, 1953; Spence, 1958) have 
stated that at higher levels of D, 
GSGs will be elevated (increased 


probability of R). Steeper GSGs 
under high D than low D (see Figure 
3—D, 2D, are also expected 
(Rosenbaum, 1953, p. 35; Spence, 
1958, Dp. 83, 84), although it is also 
noted that this GSG Steepening may 
not be observed When certain de- 
pendent variables are used. 
pected i 
higher 
are near their Upper limits 


been noted (Rosenbaum, 
). 


Competitional Situations, 
stimuli evoke competing reg 
and the Æ distribution of R, 
tirely above threshold, the predi 
are substantially the same as in the 
noncompeting situation r : 
in Figure 3, Higher p ide eae 
and steeper GSGs, With fl 


When 
Ponses 
is en- 
Ctions 


some Steepening will 
ifference 


is that at Temote stimuli wh 
competing R js dominant, Pr, will 
be reduced by drive increases and 
raised by drive decreases, 


When a portion of the E distribu- 


Which are considerably 
plicated. 


x 
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and SG which are represented in 


‘Figure 3. Since the predictions suit- 


able for a specific situation depend 
upon the boundary conditions, it is 
imperative that predictions and con- 
clusions be presented in the context 
of the relevant boundary conditions 
as we shall attempt to do. 
The experimental situation we will 
use to illustrate these varied effects is 
the same as the situation represented 
in Figure 3 with two important differ- 
ences, First, presentation of either 
Sr or S, is assumed to evoke both Ri 
and, at lesser H, a response (Rs) which 
IS Incompatible with Ri. The com- 
peting response may be the result of 
any of a number of factors, such as 
Previous training to Sp or Sa or 
generalization from another stimulus. 
econd, trials are ended only by the 
Occurrence of either Ry or Re. The 
UE implications of the energizing 
z i in this situation are illustrated 
lel it 4. The computations were 
Sine on Spence's (1956, Appendix A) 
od for computing response proba- 


Vy in a situation with two alternate 


r 
“sponses. GSGs have been com- 
" (erent aa 
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Š 7o f 
a 20 
ec 
a 
60 : 
50 T : 
Sr Sa 


Fic. 4. Some hypothetical GSGs at different 
and R;) alternate responses. (Computations assume th 
Under the lowest drive level illustrate 
At left, E, — L = .1, and E, — Es = .1 at Sa under 
Threshold = 1.0.) 


rence of R, or Re. 
at Sr. 
E, — E: = .1 at Sa under D. 


puted for one level of drive (repre- 
sented as D) and higher levels of 
drive, each twice that of the next 
lower level (2D, 4D, 8D). 

As Figure 4 illustrates, when a com- 
peting R is present, either increased 
or decreased drive may elevate the 
GSG while also sharpening or flatten- 
ing the GSG, or it may lower the GSG 
with concurrent sharpening or flat- 
tening. Two additional possibilities 
not shown in Figure 4 should also be 
noted. Increased D may raise the 
GSG at Sr while lowering it at 
a generalization stimulus (e.g., if 
H, < He at Sa). Alternatively, it is 
possible to find conditions where a 
decrement in Pp, at Sr and an increase 
in Pr, at Sa would be expected 
under increased D—for example, when 
L=1, E — L= 1.5, and E; — E; 
= 1.5 at Sr, while at Ss, E1— L 
= 1.2, and E, — E; = .2. (When D 
is doubled, these conditions would 
lead to a very small R, decrement at 
Sr and facilitation at Sa.) 

Dependence upon Boundary Condi- 
Hons. Since such varied and contra- 
dictory results are all theoretically 


100: 
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drive levels in a situation with two (R, 
hat trials are ended only by the occur- 
d, D, £1 — L = .5, and E, — By = .5 
D. Atright, E, — L = .2, and 
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ible, it would seem that in order 
cum on energizing account to be most 
useful or testable, predictions from 
the account should be made in con- 
junction with statements of boundary 
conditions which would make the 
predictions theoretically necessary. 
This is especially true when competing 
responses are evoked in a situation or 
where predictions are made about 
GSG slope. Predictions for GSG 
elevation in noncompetitional situa- 
tions are monotonic (e.g., lowered 
GSGs are not possible with increased 
D, and GSG elevation cannot occur 
under decreased D) and are consistent 
with the predictions which have 
usually been made from the energizing 
account. 

A number of variables must be 
considered in determining the bound- 
ary conditions which make specific 
predictions possible. The following 
are some implications of the energizing 
account for GSGs, presented in the 


context of the relevant boundary 
conditions: 


1. One of the more unusual im- 
plications is that a GSG may be 
lowered by increased drive or raised 
by decreased drive. These inverse 
relationships between D and GSG 
elevations are possible only if trials 
can be ended solely by occurrence of 
an experimenter-designated dominant 
or competing response (as in Figure 4) 
rather than being time limited. Thus, 
if trials have a constant length and 
R, is dominant, according to the 
energizing accounts Pp, cannot be 
reduced by increased drive (see Broen 
& Storms, 1961, for further discus- 
sion). Thus, the use of time-limited 
trials may be desirable in experimental 
tests to reduce the ambiguity of the 
relevant predictions. 


2. The above-described threshold 
effects which may result in lower Pp, 
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under higher D are at a maximum 
when E; — L and E, — E» are small 
(see Spence, 1956, Figure 71). In 
such a situation, the proportion of the 
distribution of E, raised above thresh- 
old by D is at its greatest as compared 
to the counteracting effect of drive in 
increasing E; — L and Ej — Eo A 
testable consequence is that if in- 
creased D raises the GSG at any 
stimulus, further increases in D (act- 
ing in a situation with even greater 
E; — L and E, — E») will have a 
monotonically increasing relationship 
to the height of the GSG at that 
stimulus. This holds for both com- 
petitional and noncompetitional (see 
Figure 3) situations. The converse, 
that if lowered D lowers the GSG, 
further decreases in D will have a 
monotonically decreasing relationship 
to the height of the GSG, is neces. 
sarily true only for noncompe 
situations. 

3. Specific predictions about whether 
flattening or sharpening in gradients 


of response Probability will occur 
under increased or de 


very difficult to make foun 
gizing account. For example, in the 
situation illustrated jn Figure 
knowledge of E, — L and E, — By 
both Sz and Sa and amount m 
increase in D is needed to make suc 
predictions. Even if ingenious pri 
experimentation were done to obt: 
knowledge of the level of 5 Nes. 
experimental situation, additional / 
sumptions would be needed to b 
late most experimental drive Variation 
(e.g., a change in shock intensity) inte 
a specific change in the theoretical 
construct D. This kind of specific 
knowledge is needed to make predic- 
tions because of the nonmonotonic 
relationships between D and slope of 
GSG which are implied b 


gizing account. 


titional 


y the ener- 
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P Drive Stimuli and GSGs 


Drive-stimulus (Sp) effects have 
been omitted in the above discussion 
of energizing accounts of drive and 
GSGs. However, both Hull (1952) 
and Spence (1956) have mentioned 
the importance of Sp contributions to 
H and E. Therefore, we will com- 
:ment briefly regarding effects of Sp 
on GSGs and note some implications 
and problems. -Rather than include 
competing responses specifically asso- 
ciated with different intensities of Sp, 
we shall limit our discussion to the 
effects of Sp generalization. 

When training is carried out at one 

F drive level and testing at another, the 

stimulus situation at the time of 

| testing is less similar than it would be 

| if training and testing drives were the 

same. Thus, the H associated with 

: Sp would be reduced, with a corre- 
sponding reduction in E. - 

\ Possibly because of difficulties in 

“> the theoretical formulation and em- 

À pirical measurement of Sp or his 

| greater interest in energizing effects, 

‘Spence has generally controlled for 

Sp effects or omitted them from 

sideration (Spence, 1958; Spence, 

sdrich, & Ross, 1959). Although 

incerned with difficulties in his 

Tmulations, as we shall see, Hull 

952) provides some specification of 

^` effects in his postulate system: 


— RR 


ch drive condition (Cp) generates a char- 

teristic drive stimulus (Sp) which is a 
“©notonic increasing function of this state 
Lb. 7 postulate V]. 
Wh it i in association É 
ss Pal dr (52) and its strength is 
Sted under a different drive intensity, there 
Vill result a falling gradient of sHg and sir 
Cp. 11, Corollary xii]. 
t that sÉg will fall off 
hange in drive will 
GSG as compared 

Perhaps Hull 
he contribution 


with a 


This statemen 
implies that any € 
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of Sp to E would be lowered, so that 
the corollary would apply only to 
situations where, as the result of 
learning, drive stimuli have come to 
have primary control over the re- 
sponse. If other than drive stimuli 
contribute substantially to H, the 
energizing effect of D could outweigh 
the drive-stimulus decrement, leading 
to increased E and higher GSGs, as 
Hull (1952) predicts in Theorem 59 
(p. 226). 

There are other problems in the 
application of Hull’s principles con- 
cerning Sp: 

1. Since Sp vary in intensity, Sp 
generalization follows the principles 
of stimulus-intensity generalization 
(Hull, 1952, p. 11). However, Hull 
(1952, p. 89) cites evidence which 
he says implies a defect in his analysis 
of stimulus-intensity generalization, 
and therefore has substituted the 
methods used for qualitative-general- 
ization gradients rather than apply 
his methods for computing stimulus- 
intensity generalization gradients (p. 
220). 

2. Since Hull states only that Sp is 
a monotonically increasing function 
of Cp, it is impossible to compute the 
amount of Sp change associated with 
a given change in D. . 

3. The energizing effect of stimulus 
intensity, V, is included in the 
formulas for stimulus-intensity gen- 
eralization. In derivations regarding 
Sp generalization, are, V and D to be 
considered separately in the computa- 
tions, or should V be omitted since 
the same conditions, Cp, give rise to 
both? If one shares Hull's lack of 
confidence in his stimulus-intensity 
generalization analysis, this question 
may be unimportant, except insofar as 
answers contribute to more adequate 
formulations. 

As we have seen, almost any effect 
is consistent with the energizing 
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account, given the proper conditions. 
The introduction of substantial Sp 
effects leads to modifications in the 
predictions and even greater .com- 
plexity in the boundary conditions 
which have to be considered. Since a 
clear theoretical statement regarding 
Sp effects is lacking and the evidence 
concerning Sp changes and GSGs is 
sparse, we shall avoid compounding 
the complexity of the predictions and 
go on to consider one clear implication 
of Hull’s account of the effects of Sp 
changes. 
Since the same drive level is present 
at all stimuli on the stimulus con- 
tinuum, when testing drive is different 
from training drive, the same E is 
withdrawn at each point on the 
stimulus continuum. Employing the 
formula for the withdrawal of E (Hull, 
1952, p. 9), we find that removing a 
given amount of E will reduce the 
total E by a larger amount when the 
initial E is small (as at generalization 
stimuli) than when the initial E is 
larger (as at the training stimulus). 
Let E; represent the E at the training 
stimulus, E» at a generalization stim- 
ulus, and E; the amount to be with- 
drawn because of a drive-stimulus 
change, while M is the maximum pos- 
sible E. The difference in the changes 
in E, and Fy, (E\— Ey’) — (Es— E), is 


(E, — Ey) ( "x 


Since this value is always negative, the 
change in the smaller E (E) is greater 
than the change in the larger E (Ei). 
Hence, in terms of E, the resulting 
GSG is steeper. 

Thus, ignoring the energizing effects 
of drive, Sp generalization invariably 
leads to lower and steeper GSGs in 
terms of E. (In terms of response 
probabilities, some flattening would 
be introduced if the E, distribution 
moves below threshold.) This is the 


one invariant prediction regarding D 
and SG which follows from Hull's 
theory. It holds whether Sp general- 
ization follows stimulus-intensity or 
qualitative-generalization principles. 
It is also independent of which posi- 
tions are taken regarding the other 
problems in Hull's Sp formulation. 
However, these problems affect the 
manner in which S; effects are com- 


bined with energizing effects to modify 
the predictions of the 


energizing 
accounts. 


Comparison OF THE THEORIES 


An important feature of D XH 
energizing accounts of drive and SG 
is that they sometimes predict non- 
monotonic relationships, for example, 
sharpening of GSGs followed by 
flattening with increased D. Thus 
specification of amount of D is neces. 


Prediction for a 
Situation, A]. 


Additional difficulties are involved 
in making Predictions which depend 
upon both an energizing effect and a 
drive-stimulus change effect of drive 
or upon changes in sampling proba. 
bilities and drive-stimulus elements, 
Shared by the theories under consid- 
eration, these difficulties have to do 
with the fact that differences in drive 
stimuli must be inferred from the very, 
drive-inducing conditions and con- 
sequent changes in dependent vari- 
ables which are used as evidence ofa 
change in drive. Predictions often 
depend upon the relative predomi- 
nance of energizing and drive-stimulus 
effects or sampling probability and 
drive-stimulus change effects, while 
the determination of this predomi- 
nance in an experimental situation is 
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at least extremely difficult. Deter- 
mination of drive-stimulus intensities 
as a function of D and the psycho- 
physical scaling of drive intensities 
would be required for the application 
of Hull's formulas, and similar speci- 
fication would be required for other 
D X H accounts. Similarly, the num- 
ber of drive-stimulus elements and the 
change in the elements available for 
sampling for a given drive change are 
needed to make specific predictions, 
but are not specified in Estes' theory 
and cannot be derived from empirical 
determinations, 

Although most predictions are de- 
pendent upon specification of amount 
of initial drive, change in drive, and 
Change in drive-stimulus intensity or 
drive-stimulus elements, some pre- 
dictions from Estes are more general 
than any from the D X H accounts. 
Estes" theory leads to at least two 
unequivocal predictions, The first is 
that increased drive will lead to flatter 
GSGs while decreased drive will bring 
about ,Steeper GSGs. Since this 
prediction is independent of experi- 
mental arrangements, Estes' theory 
Is more testable than the others in this 
respect, and testability is certainly a 
desirable feature of a theory. Another 
unequivocal prediction from Estes is 
that a response will have some non- 
zero probability of occurrence even at 
extremely remote generalization stim- 
uli (because some drive stimuli are 
conditioned to the response). If any 
stimulus situation whatever can be 
found such that p = 0, this prediction 
is disconfirmed. Competing responses 
and the threshold allow Hull and 
Spence to predict p = 9 jy some 
circumstances. A recent revision of 
Estes’ theory (19592) stating that 
responses are conditioned to patterns 
of stimuli rather than to individual 
stimulus elements circumvents this 


difficulty, but at the expense of greatly 


increased complexity and decreased 
testability. 

As we have seen, the combination 
of threshold effects with response com- 
petition in D X H accounts leads to 
very complex and nonmonotonic rela- 
tionships between D and SG. These 
relationships are not easily compared 
with derivations from Estes. How- 
ever, clearer predictions not requiring 
quantitative specification of variables 
can be made for classes of situations 
in which Æ; is high enough to be above 
threshold or where there is no response 
competition. These predictions can 
be compared with the corresponding 
predictions from Estes. 

The predictions from Hull and 
Spence differ from those derived from 
Estes most strikingly with regard to 
slope, where Estes' predictions are 
clear-cut, as we have seen, and those 
from Hull and Spence are ambiguous. 
However, if energizing effects pre- 
dominate, with A; suprathreshold and 
Pr, not initially close to 100%, the 
prediction from Hull or Spence is that 
decreased drive will lead to flatter 
GSGs, which is more consistent with 
the meager evidence (Dinsmoor, 1952; 
Newman, 1955; Thomas & King, 
1959) than Estes' unambiguous pre- 
diction of steeper GSGs with lowered 
drive. With higher testing than 
training drive, some evidence shows 
steepening (Dinsmoor, 1952; New- 
man, 1955) and some shows flattening 
(Broen et al., 1963). In the Jenkins 
et al. (1958) experiment, which con- 
trolled for drive-stimulus changes by 
training and testing at the same drive, 
higher drive was associated with 
flatter GSGs, as predicted by Estes. 
However, in a portion of the experi- 
ment by Thomas and King (1959) 
which controlled for drive-stimulus 
changes in the same manner, higher 
drive was associated with steeper 
GSGs, and for drive levels higher than 
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MECHANISM OF THE FIGURAL AFTEREFFECTS 
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University of California, Riverside 


i d that the figural aftereffects are actually a species of 
Totes Misa in which the afterimage of the inspection object 
acts as an inducing figure. The afterimage displaces the test contour 
in phenomenological space because the inducing contour exerts inhibi- 
tion on the test contour. A simple mathematical theory is described 
which accounts for the spatial distribution of displacements. 


When contours of visual objects are 

close together, they often induce shifts 
in their respective apparent locations. 
Since the objects are not, in fact, 
changed in their positions in the visual 
field, it must be the neural correlates 
of these edges or lines which somehow 
interact, shifting each other’s positions 
in their topographic projection in the 
visual system. The phenomenological 
result is a deviation in perception from 
veridicality, an illusion. In Figure 1 
the lines of the square on the left tra- 
versing the concentric circles are so 
affected by each intersecting arc as to 
increase slightly the angle of each in- 
tersection, especially if the angle of 
intersection is somewhere midway be- 
tween zero and 90°. This gives the 
square a slightly concave appearance. 
Many illusions—such as those of Her- 
ing, Wundt, Müller-Lyer, Zöllner, and 
Poggendorff—seem to involve such a 
common principle. 

There is another class of con- 
tour shifts—the figural aftereffects—in 
which the inducing contour! seems to 


1Using commonly accepted. terminology, 
we call the inducing figure a line, edge, ob- 
ject, etc., introduced as an independent vari- 
able into O's sensory field, visual or other. 
The test figure is a dependent variable, a 
type of probe stimulus, upon which the ef- 
fect is exerted. In this paper we will be 
concerned mostly with a dimming or dis- 
Dlacement in appearance. The comparison 
figure is any figure which is placed far 
enough away from the inducing figure so as 
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act proactively. The experimenter (E) 
presents an inducing figure in the 
observer’s (O’s) visual field, then re- 
moves it, and then presents a test 
figure. Even with this temporal sepa- 
ration between the two figures, the 
first still continues to act on the second 
so as to shift the second’s apparent 
position. 

Figure 2 illustrates an experiment 
involving such a procedure (Köhler & 
Wallach, 1944). O is first shown the 
two black rectangles (marked I in the 
figure), which are the inducing figures. 
He fixates the X mark. Then, usually 
after about 60 seconds of inspection, 
these rectangles are removed. Now 
the two outline squares (each a test 
figure) are shown, and O again fixates 
the X mark. Frequently O will report 
that the square on the left looks higher 
than that on the right (in fact they 
are equally high). This is a typical 
displacement effect; contours are re- 
pulsed away from the edge of the pre- 
viously shown inspection figure. The 
displacement effect is typically maxi- 
mal not when the inducing and test 
lines abut, but when they are actually 


to be only mildly affected by the independent 
variable. Thus, any change in appearance 
or position of the test figure is gauged by 
contrasting it with the less affected compari- 
son figure. The interfigural distance (S) 
is always the distance between the inducing 
and the test figure (in minutes of visual angle 
unless otherwise noted). 


mc 
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du nor LUE geometris illusions, (In the left figure, the lines of the square traversing 

lo ihe vig E Er " are seen by many observers to turn toward the center of the circles. 

converging tee s ir square is seen as a trapezoid, the side closer to the intersection of the 

pae i 5 ooking larger. Note that in both cases the phenomenon involves a growth 
he angles formed by lines intersecting at 10°-40°.) 


separated. Köhler and Wallach have 
called this the paradoxical distance 
effect. 

In the experiment illustrated in Fig- 
ure 3, we first present O with a circle 
as an inducing figure, and O fixates 
the X mark on the right edge of the 


I 


I 


Fic. 2. A stimulus i c 
Köhler and Wallach tiene ie crak ame 
aftereffects. (O fixates the X iar aud 
the I figure is presented. Then the I figure 
is removed, and the T figures are presented. 
The T figures are then seen more dimly, and 
repelled away from the former locus of the 
I figures. The repulsion is reported by 
many Os when I and T are presented 
together.) 


circle. This figure is then removed, 
and an outline rectangle, T, is pre- 
sented, with the same fixation mark, X, 
as shown. O reports that the left part 
of the rectangle looks as though it has 
shrunk (a size effect, often equivalent 
to the displacement effect and again 
involving a repulsion away from the 
inspection-figure contour). Subjects 
also report that the test contours left 
of the fixation mark (inside the former 
inspection circle) look dimmer, or 
washed out, with less contrast (the so- 
called color effect). Lastly, the left 
half of the rectangle is often seen fur- 
ther back in space (the depth effect). 


I 
Fic. 3. Same as in Figure 2. 
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It is not necessary to present the 
inducing and test figures in succession 
to see the displacement effects. Many 
Os looking at Figures 2 or 3 from 
about 3 meters distance, with the in- 
ducing and test figures shown simul- 
taneously, still report the displacement 
effects we have described. This simi- 
larity in results between the simul- 
taneous and successive presentations 
was already apparent to Kóhler and 
Wallach in their earlier experiments 

and has recently been reiterated (Lo- 

gan, 1960). Do the two sets of dis- 
placement effects have identical mecha- 
nisms? This constitutes the point of 
departure of the present theory. 

The initial theory of the “Figural 

Aíter-Effects" (Köhler & Wallach, 

1944) used simple physical principles 


involving current flow in a volume 
conductor such as the visual cortex to 
derive the distortions in perceptual ap- 
pearance. The theory attracted con- 
siderable attention, some of it critical 
(Day, Pollack, & Seagrim, 1959; 
Hebb, 1949; McEwen, 1958; Spitz, 
1958). The main criticisms have been 
that: 


1. Lateral direct-current flow can be 
disrupted without noticeable changes in 
perceptual function (Lashley, Chow, & 
Semmes, 1951; Sperry & Miner, 1955; 
Sperry, Miner, & Myers, 1955), 

2. 'The theory assumes that a topo- 
logically faithful "picture" is trans- 
mitted to the cortex (Hebb, 1949), but 
in fact each synaptic stage is the site 
of radical information processing (Hu- 
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Fic. 4. The magnitude of the displacement effect when the distance between the inducing 
and test figure is varied. (The inducing figure is a white square and the test figure, a white 
vertical line. They are presented together, as depicted in the insert, in the manner of a 
simultaneous illusion. The curves on the left represent two successive series of measurements 


with I and T figures presented together, N = 60. 


The rightward curve, Pollack, 1958, 


represents data, adjusted in height, from a successive presentation: I and then T.) 
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(Positive values represent a loss in sensitivity: The arrowheads 
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bel & Wiesel, 1959, 1962; Kuffler, 
1953; Lettvin, Maturana, McCulloch, 
& Pitts, 1959), 

3. The theory is basically a fatigue 
theory, yet the effects build up almost 
instantaneously (Ikeda &  Obonai, 
1953 ; Kéhler & Wallach, 1944, p. 355; 
pama, 1953, 1956). In fact, as we 

ave seen, one often obtains quite com- 
parable results by presenting the induc- 
ing and test figures simultaneously. In 
a recent experiment, 60 subjects were 
shown a simultaneous presentation of 
the inducing and test figures originally 
used by Köhler and Wallach. The 
contour repulsions obtained are given 
in Figure 4. A comparison with an 
experiment in which the figures were 
presented successively (Pollack, 1958) 
suggests a close correspondence of the 
two sets of results (see also Ikeda & 


Obonai, 1955; Ikuta, 1956; Logan, 
1960; Nozawa, 1953). 

4. Displacements sometimes occur 
toward satiated areas, as in the experi- 
mental results shown in Figure 5 (see 
also Ganz, 1964; Ganz & Day, in 
press; Nozawa, 1953a, 1953b; Smith, 
1954). 

5. Independent indexes of the hypo- 
thetical construct "satiation"—for ex- 
ample the color effects and the dis- 
placement effects—can be shown to 
be independent (Ganz & Day, in 
press). Itis likely that they can occur 
at different neural loci. 

The Osgood-Heyer (1952) theory 
is also a fatigue theory and so cannot 
be reconciled with Criticisms 2-5 
above. As a hypothetical construct, it 
has not accorded well with some facts 
of visual physiology (Deutsch, 1956). 
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A New THEORY oF FIGURAL 
AFTEREFFECTS 


'To provide a satisfactory mechanism 
of the figural aftereffects, two phe- 
nomena must be — for vos 

ndently: (a) How do contours dis- 
cien "e EE and (b) How do 
contours interact when they are tem- 
porally separated? We will first pro- 
vide a somewhat general and qualitative 

description of a theory. This will be 
followed by a more detailed exposition. 

It appears that most of the difficulties 
inherent in the earlier theories evolved 
from their attempt to deal with both the 
displacement effect and the time-bind- 
ing (proaction) effect with a single 
mechanism. Since the two effects dis- 
play a rich variety of independent 
behavior, there is no recourse but to 
consider two separate mechanisms : The 
two chosen in the present study are 
lateral inhibition (7) and light adapta- 
tion. We want to explore the possi- 
bility that all contour displacements 
which are induced by neighboring fig- 
ures and are restricted to about 10 or 

20 minutes of arc in human perception 

are due to the operation of such inhibi- 

tion. This is considered to apply to 
geometrical illusions (such as in Fig- 
ure 1) and to many figural aftereffects 

(such as Figure 2 or 3). In this, our 

theory owes something to the ideas of 

Day (1962) and Deutsch (1964) in 

which similar approaches were sug- 

gested. As we will show below, the 
inhibition typically diminishes as the 
distance between the contours is in- 
creased. This descending function is 
actually steepest at very small distances, 
just a few minutes of arc. It is easy 
to understand that when a contour is 
being inhibited by such a steep func- 
tion, the portion of the contour near 
the inhibiting figure will be inhibited 
more than the portion at a greater dis- 
tance. This must shift the mean of the 
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contour’s distribution away from the 
inhibiting figure. This is the repulsion 
effect. 

Now such a theory, by itself, would 
lead one to think that the greatest 
repulsions occur when the two con- 
tours are least separated, which is when 
they touch. Yet the distance paradox 
tells us that empirically it has been 
found that repulsions are small or nil 
when the two contours touch, There 
are at least two plausible reasons why 
this should be so. One is that at very 
small interfigural distances, the two 
contours summate their excitations, In 
such a situation, it is likely that O 
actually confuses the inducing-figure 
and the test-figure contours (Taylor, 
1962). Secondly, it is known that 
eye movements are present during the 
presentation of the test figure, Such 
movements must sometimes cause the 
test figure to be placed on one side of 
the inducing figure’s edge and some- 
times on the other, This is increas- 
ingly the case as the interfigural separa- 
tion is made smaller. The displace- 
ment effect, a Tepulsion, is Opposite in 
direction on the two Sides of the induc- 
ing figure. Thus, when such cross- 
Overs are frequent, we will be adding 
judgments made now with one dis- 
placement, now with an Opposite one, 
with the net result close to zero. Thus, 
net displacement (which is what the 
grouped experimental results give us) 
will approach zero as interfigural dis- 
tance approaches zero, Although we 
are in general agreement with Taylor 
that contour confusion must play a 
part, the evidence below suggests that 
this is not sufficient. The present 
model explores the second alternative. 

We next consider the temporal prob- 
lem: How does the inducing figure act 
Proactively, crossing a time gap to 
displace the test figure? When O 
views a dark figure, there is a corre- 
sponding retinal area which dark 
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adapts more than its surround; the 
portion of the retina of a subject where 
a bright figure is imaged will be a site 
of greater light adaptation in compari- 
son with the surround. When the in- 
ducing figure is then removed, O will 
view stimuli with a heterogeneously 
adapted retina: a retina with some 
regions that are more dark adapted and 
regions that are more light adapted 
than the remainder. The result is 
that under normal levels of illumina- 
tion, the subject will, in fact, continue 
to perceive the inducing figure. It will 
be Seen as a light-dark reversal of the 
original, and it will decay in time. 
But while it is present, its. edges will 
interact with the test figure as they 
do in simultaneous illusions. This is 
because the stimuli which are created 
by a heterogeneously light-adapted 
retina are stimuli which in many re- 
spects are identical, from a neural point 
of View, to those created by a distal 
stimulus onto a homogeneously adapted 
retina. In many situations the subject 
cannot actually discriminate between 
the two inducing figures, an ordinary 
stimulus originating in the distal en- 
vironment and an afterimage (Barlow 
& Sparrock, 1964; Brindley, 1963). 
We suppose that the contours interact, 
even when interocularly produced, at 
the visual cortex to produce simultane- 
ous illusions. The afterimage exists 
beyond the preservation of the distal 
figure and decays in time. This is the 
mechanism which bridges the time gap 
in the figural aftereffect situation; or, 
more properly, the processes of light 
and dark adaptation bridge that time 
gap. 

Below we first examine the evidence 
on lateral inhibition, and we then pre- 
sent the theory in a quantitative form. 
We next examine the spatial distribu- 
tion of displacement effects as they fol- 
low from the theory. In a subsequent 
Paper we will compare the growth and 


decay of light and dark adaptation on 
the one hand and the growth and decay 
of figural aftereffects on the other. 
The similarities in temporal properties, 
which are quite striking, will be used to 
substantiate our argument. 


Lateral Inhibition 


Psychophysical Evidence. Because 
of diffraction at the edge of the pupil 
and because light is scattered within 
the eye (Bartley, 1951, pp. 924-926; 
Boynton & Riggs, 1951), slit-shaped 
stimuli of high contrast are projected 
on the retina as broad, diffuse distribu- 
tions (Flamant, 1955) considerably de- 
graded in contrast. The anatomy and 
physiology of the retina suggest con- 
siderable spatial integration, even in 
the fovea, extending minimally to many 
minutes of arc and maximally (as in 
the periphery) to 1 or 2 degrees of arc, 
perhaps more. These facts might seem 
difficult to reconcile with the finding 
that in many vertebrate species visual 
resolution (the smallest consistently de- 
tected gap between two thin bars) is of 
the order of a single minute of arc. 
In fact, it is well known that the effects 
of spatial integration are held in check 
by a counterforce: lateral inhibition. 
Psychophysically, lateral inhibition is 
manifested by the appearance of con- 
trast: Black solid figures appear black- 
est at their edges with white; white 
surfaces appear brightest at their edges 
with dark figures. Phenomenological 
contrast was known to Aristotle, was 
utilized technically by Leonardo. da 
Vinci to enhance light values (Boring, 
1942), and was first examined quanti- 
tatively by Lehmann (1886) and Hess 


and Pretori (1894). 

Of the aspects of simultaneous con- 
trast which are of interest to us, it 
should be noted that brighter stimuli 
inhibit dim ones more readily the 
greater the brightness difference of the 
two patches. This can be seen in Fig- 
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Fic. 6. The test-field luminance needed to match a 
the inducing field, which surrounds the test field, is 


action of the test field. The test field mu 
with the comparison field. A, B, C, etc. 
field—Heinemann, 1955.) 


ure 6, taken from Heinemann (1955). 
In the figure the more intense inducing 
Stimulus produces more inhibition. 
And for a range of values, it appears 
that the log matching luminance is a 
linear function of the log inducing 
luminance. One study (Leibowitz, 
Mote, & Thurlow, 1953) reports the 
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Interfigural Distance (mins. of arc) 
Fic. 7. The amount of inhibition exerted 
by a figure on another When their separation 
is varied. (The various functions depict 
effects of inducing figures of different in- 
tensities. An ordinate value of 1.00 log mL 
represents a 10-fold dimming of the test 
figure; zero mL represents no effect. 

Adapted from Leibowitz et al, 1953.) 
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constant comparison field. (When 
made more intense, it inhibits the 


to obtain a match 
of the comparison 


dimming of the test field by the induc- 


ing field to be Proportional to the log 
inducing field luminance, 


Lateral inhibition js quite sensitive 
to the distance between two stimuli, 
For example, Beitel (1936) found that 
the empirical curve relating the amount 
of inhibition of the inducing stimulus 


on the test stimulus, as a function of 
luminance, was Steeper when the two 
stimuli were 


adjoined more closely. 
In Figure 7, adapted from the data of 
Leibowitz et 


1 al. (1953), we see that 
inhibition diminishes when the distance 


between two stimuli is increased, 
Most of the inhibition occurs within 
the initial 10 minutes of arc, whether 
the inducing stimulus js strong or 
weak. Thus, when the inducing stim- 
ulus is more intense, the distance func- 
tion is steeper. This property will be 
utilized in the theory below. 

Contour Inhibition, Thus far we 
have described only inhibition that is a 
function of luminance levels. There is, 
however, an inhibitory effect which is 
more properly described as that of one 
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DIFF. THRESHOLD 


BRIGHTNESS OF C 


: Fic. 8. By either increasing or decreasing the luminance of Annulus C, we can introduce 
contour at the B-C junction. (This appears to interfere with the differential brightness 


threshold at the A-B junction. 


The lowest threshold is obtained when the B and C 


CS es are equal, that is, when there is no contour between B and C. The results are 
shown in the figure on the right—from Dittmers, 1920.) 


contour edge on another, and this ef- 
fect will be of particular interest to us. 
It has been observed that the resolution 
of two bright parallel bars is improved 
by increasing their intensity (Fry & 
Cobb, 1935). For wide bars (33 min- 
utes of arc) acuity was continuously in- 
creased at higher luminance levels. 
However, with thinner bars (3 minutes 
of arc) acuity sharply declined with 
luminance levels above about .5 log 
trolands. The interprepretation was 
made that acuity declines in the case 
of the thin bars, at higher luminance 
levels, because the two edges of each 
thin bar, 3 minutes of arc apart, were 
inhibiting one another. The effective- 
ness of this variable was confirmed by 
Fry and Bartley (1935) using a test 
Situation depicted in Figure 8. The 
threshold of a difference in brightness 
between areas B and A is determined. 
The introduction of a dark ring at the 
outer border of B with C and concen- 
tric with A is found to raise that thresh- 
old. The threshold rise is greater the 
closer the dark ting is to the AB 
border. It appears that the B.C edge 
is exerting an inhibition on the A-B 
edge, thus making its detection more 
difficult. A very similar type of con- 
tour inhibition seems to play a pre- 
dominant role in one type of metacon- 
trast (Alpern, 1952; Kolers & Rosner, 


1960; Werner, 1935) where one edge 
interferes with an adjoining one pre- 
sented somewhat earlier in time (us- 
ually on the order of 100 milliseconds). 
Metacontrast contour interference is 
more powerful when the two contours 
are closer (Kolers & Rosner, 1960). 

There is an additional difference be- 
tween simultaneous contrast and con- 
tour inhibition which is pertinent. 
Brightness contrast is an asymmetrical 
phenomenon: Bright areas inhibit dim- 
mer ones, but dim areas have little ef- 
fect on brighter ones (Graham & 
Granit, 1931; Heinemann, 1955). 
Contour inhibition, however, is much 
more symmetrical: A white edge on a 
gray background exerts about as much 
inhibition on the detection of a dark 
line as does a black edge on a gray 
background. In an experiment by 
Dittmers (1920) a stimulus situation 
very similar to that in Figure 8 was 
used. The effect of varying the bright- 
ness of C upon the differential thresh- 
old for A was investigated. The 
results are also shown in Figure 8 
(right). The threshold is lowest when 
the luminance of B and C are equal, 
that is, when no edge is present be- 
tween B and C. Making C either 
brighter or darker interferes with the 
establishment of an edge contour be- 
tween A and B. This must mean that 
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it is the luminance discontinuity per se 
at the B-C edge which interferes, partly 
independent of the direction of the 
gradient. Fry and Bartley (1935) 
have obtained the same symmetry with 
a basically similar stimulus. ^ Since 
figural aftereffects also show such a 
symmetry—approximately equal dis- 
placement effects are obtained with 
white-on-black or black-on-white in- 
spection figures—inhibition must have 
a similar symmetry if it is to be in- 
voked as the mechanism of displace- 
ment. Simpler simultaneous contrast 
does not have this symmetry : White in- 
hibits dim regions, but darker areas do 
not inhibit the brighter ones (Heine- 
mann, 1955). For example, Graham 
and Granit (1931) found that two ad- 
joining semidiscs of unequal bright- 
ness affected each other's critical 
flicker fusion (CFF) asymmetrically. 
The CFF of the less bright areas di- 
minished slightly (about 1.5%) while 
that of the brighter area increased 
(about 5.4%). 

Physiological Evidence. The physi- 
ological analysis of inhibition has paral- 
leled the psychophysical data so clearly, 
although it trails the latter in precision, 
that it sems worthwhile to present some 
of that evidence. For one of the ad- 
vantages of an inhibitory model of 
contour displacement is precisely that 
the mechanism is well documented 
physiologically, and in some detail. 
The physiological analysis of inhibitory 
mechanisms in the visual system begins 
with the work of Hartline (1940) who 

measured the receptive fields of single 
optic fibers of the frog, and therefore 
of single ganglion cells, using very 
small spots of light. Small areas of 
stimulation were shown to follow 
Ricco's law of areal summation. But 
increasing the area beyond a certain 
point actually increased the threshold. 
This was one of the earliest neurophysi- 
ological indications of the operation of 


a visual inhibitory mechanism. Kuf- 
fler (1953) demonstrated the existence 
of an antagonistic (inhibitory) annulus 
surrounding the receptive field in the 
retina of the cat. A small spot of light 
in the center of a receptive field might 
give an on response, but if the spot was 
moved to the periphery, the same cell 
showed an off response. The two 
spots together gave a much weaker re- 
sponse, the stronger spot luminance 
determining whether the response 
would take on the characteristic of the 
center or the periphery. Thus, a true 
antagonistic mechanism was at play. 
Barlow (1953) has shown, on the frog's 
retina, a similar mechanism at work. 
First, the response of a spot of light 
is measured when placed in the center 
of the receptive field. Next, a second 
spot is introduced which, although it 
registers no response itself, inhibits the 


response to the centrally placed spot. 


tibitory action is 
intensity of the 
ause we are interested 


; active inhibitory effects of a 
stimulus, it is interesting to cite one of 


Kuffler’s (1953) observations. He 
noted that the suppression of an off 
response, by stimulation of an on area 
of a receptive field, could be seen for 
some time after the on area’s stimulus 
had been turned off. “It was found 
that beam (A) to an on area could 
suppress (B) stimulation of an off 
area for varying periods after the (A) 
had been turned off. The time course 
of the inhibitory after effect depended 
on the duration and the intensity of 
(A) [p. 53; italics mine]. Con- 
versely, a ganglion cell which gave 
an off effect was often found not to re- 
spond to a small Spot of light directed 
to the on area during the period of 
the off discharge. Thus, the antagonis- 
in action of a receptive field has been 


ound to outlive the duration of the 
stimulus, 


Furthermore, this inj; 
proportional to the 
second spot. Bec: 
in the proactive 


p 
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Right eye 


Left eye 


at V uk ne vertical lines are action potentials of a single neuron in the visual cortex of 
es ae spur white bar when it first enters the top of the receptive field gives an 
eue d hen an off response at the center of the field, and an on response again at the 
eee receptive field. This can be obtained from stimulation of either or both eyes, 
ios ME retinal points are used. When both eyes are stimulated together, Record 3 
es ation, an even stronger response is obtained. In Records A and B, the on 

is seen to be activated by either eye and to summate binocularly ; in C, this is 


shown to apply to the off response.) 


Hubel and Wiesel (1959, 1962) have 
extended this analysis, examining the 
receptive fields of cortical neurons. 
These receptive fields take on compli- 
cated shapes and are frequently elon- 
gated. Whatever the shape, the cor- 
tical neuron gives its strongest response 
(for so-called "simple" fields) only 
when numerous light spots are used to 
summate the activity within the field. 
(For example, a narrow, slit-shaped 
Stimulus oriented vertically within a 
vertical receptive field gives a maximal 
response.) As with retinal cells, and 
also lateral geniculate cells, these cor- 
tical cells have receptive fields with 
flanks of an inhibitory character. But 
what makes the response of cortical 
cells distinct is that line stimuli E 
edges are the more powerful stimula- 
tors, and closely adjoining or parallel 
lines and edges are the more powerful 
inhibitors. Diffuse light gives a mini- 
mal response. This clearly appears to 


provide a mechanism for edge inhibi- 
tion per se, such as has been described 
above. 

Binocular summation is another dis- 
tinctive property of the cortical cells. 
Summation occurs between similarly 
oriented stimuli, such as two hori- 
zontal lines presented to corresponding 
retinal areas of the two eyes; inhibition 
occurs when a stimulus is presented to 
the inhibitory flank of the other eye. 
This is shown in Figure 9, taken from 
Hubel and Wiesel (1959). The 
mutual suppression of closely adjoin- 
ing parallel contours has long been 
known in psychological literature. 
(For the binocular suppression of con- 
tours, sce Asher, 1953; Day, 1961.) 
But this is the first evidence we have 
ofa neurophysiological mechanism. It 
provides a clear physiological basis for 
a theory in which a contour process 
originating in one eye and a second 
contour process originating in the other 
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eye are both projected cortically and 
inhibit each other at the cortical site. 
Hartline, Ratliff, and Wagner 
(1956) and Ratliff and , Hartline 
(1959) have made a quantitative analy- 
sis of lateral inhibition among single 
ommatidia of the limulus eye. Al- 
though there are many apparent dif- 
ferences between arthropod and verte- 
brate visual systems, contrast-enhancing 
mechanisms seem to be present in both 
(including the Mach bands), and this 
suggests that we can gain something 
from a comparison of the two Systems, 
proceeding with due caution. 

In the limulus eye, interaction 
among ommatidia is inhibitory; there 
is no neural spatial summation. If we 
measure the response of a single omma- 
tidium and then illuminate neighboring 
ommatidia, the threshold of the recep- 
tor unit being measured is raised, the 
number of impulses it discharges in re- 
Sponse to a suprathreshold flash is 
diminished, and the frequency with 
which it discharges impulses during 
steady illumination is reduced. The 
following factors affect the magnitude 
of inhibition: 

1. Intensity—the more intense the 
inhibitory illumination, the deeper and 
longer is the initial depression in fre- 
quency of discharge. The steady level 
is linearly related to log intensity of 
inhibition. 

2. Area—the larger the size of the 
neighboring area that is illuminated, 
the greater the slowing down in rate. 

3. Distance—the effectiveness of the 
inhibition diminishes with distance. If 
inducing and test stimulus are further 
apart on the surface of the compound 
eye of the limulus, then the slope relat- 
ing the intensity of the inducing figure 
to the amount of inhibition of the test 
figure is flattened. This also must 

mean that the slope of inhibition (J) 
as a function of distance between I and 
T is steeper when J is more intense, 


The effect of induction and test is 
mutual. The inducing figure’s inhibi- 
tion of the test figure depends on the 
illumination of the test figure, since the 
test figure in turn inhibits the inducing 
figure. Thus, the rate of discharge, 
Tp in the receptor “p,” set among a 
population of n interacting receptors, is 
given by the set of n simultaneous 
equations of which one is: 


n 
To = 6p — Y, Ry; (rz — Y^pj) 
j=1 


1<S<ż<n, [1] 


where ep is the discharge from receptor 
p when it is illuminated alone, r; is the 
discharge from a neighboring receptor, 
j G = 1, 2, etc.), after interaction with 
all the other n receptors in its neigh- 
borhood, kp; is the coefficient of inhibi- 

[ eceptor p, and 
7^» is the threshold amount of dis- 
charge in j just needed to inhibit T. 


1 found to fit the 


our purposes the following properties 
of the equation should be noted, Tn- 
hibition is a quantity whi 
subtracted from single-un 
each receptor after being itself inhibited 


(Hartline, 1940). 
crease, it is empirically found that the 
proportionality constant kp; is smaller, 
Thus, with a smaller distance, the func- 
tion of inhibition and intensity has a 
steeper slope. Also, at high intensity, 
inhibition as 4 function of distance is 
steeper. This, in turn, must mean that 
inhibition as a function of distance has 
a negative slope and, moreover, is not 


—— 
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linear, but is exponential or possibly a 
polynomial of higher order. 

To summarize the neurophysiological 
contribution, separate inhibitory mech- 
anisms have been discovered which 
seem to underlie both brightness con- 
trast and contour inhibition (including 
binocular effects). ^ Both types of 
inhibition diminish with distance. In- 
hibition is proportional to the log in- 
tensity of the inducing stimulus, and 
the spatial function is steeper for more 
intense inducing stimuli. Inhibition 
can sometimes operate as an aftereffect. 
And, at least in limulus, the effect is 
additive. A certain amount of dis- 
charge frequency, a result of neighbor- 
ing activity, is subtracted, and the re- 
sultant is expressed by a simple sum of 
simultaneous equations. 


An Inhibitory Mechanism of Contour 
Displacement 


We suppose first that the neural cor- 
relate of a contour can be represented 
by a ridge of activity with a cross 
section which, for illustration, is de- 


picted by T in Figure 10. We will 
assume that the mean of the distribu- 
tion will determine the apparent posi- 
tion of the contour as it is measured in 
an alignment task. The variance of 
the distribution will determine part of 
the variance of the alignment judg- 
ments. We suppose further that I, the 
inducing figure, exerts inhibition. The 
effect of inhibition is to subtract exci- 
tation from the test figure’s distribu- 
tion. In other words, the apparent 
position of the test figure in the pres- 
ence of some other figure, I, is given 
simply by the net distribution. Net T 
is shown by the dashed line in Figure 
10. 

Köhler and Wallach (1944) found 
equivalent repulsion displacements oc- 
curred whether white-on-black or 
black-on-white inspection figures were 
used. Although attempts have been 
made to demonstrate that these are not 
equivalent (Ganz & Day, in press), 
the evidence is equivocal. Assuming, 
in the absence of clear evidence to the 
contrary, that they are equivalent, it 


pe 


" 
$ 


Fic. 10. 


I : T represents a theoretical spatial distribution of a cross section of neural 
excitatory ridge to which a test-figure contour gives rise in the visual system. 


(The edge 


of an inducing figure is placed by E at à distance, S. This I figure is thought to generate 
lateral inhibition equally on both sides, The inhibition diminishes rectilinearly with distance. 
The amount of inhibition is proportional to the log contrast of the inducing figure, log L. 
In this model, the inhibition is subtracted at each point from the T distribution to give a 


bution is shifted away from T.) 


. residual excitation, net T. Displacement, D, occurs because the mean of the net-T distri- 
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appears that the type of inhibition 
which is exerted by I on T which 
causes it to be displaced is the type re- 
stricted to the action of contours on 
contours, which has been described 
above. 

As a first approximation, it seems 
plausible (Leibowitz et al., 1953; Rat- 
liff & Hartline, 1959) to assume that 
for a white line on a grey surround 
the amount of inhibition is a linear 
function of the log luminance (log L) 
of the inhibiting contour. This con- 
tour inhibition appears to decay ex- 
ponentially over a distance of some 120 
minutes of arc, with the greatest drop 
occurring in the first few minutes of 
arc. For simplicity, we explore a 
linear model in which the inhibition 
simply drops linearly to zero at a point 
m away on both sides of the inhibiting 

edge. This is shown in Figure 10. If 

the middle of the test figure is at the 

center of a coordinate system and the 

inspection figure at —S on the abcissa, 

then it can be shown that the amount 
of inhibition, I, at a point, x, is given 
by: 


where L is the luminance of the induc- 
ing figure, S is the distance between 
the inducing and test figures, and m is 
one-half the width of the total region 
over which inhibition is exerted by the 
inducing figure. Log L/m gives the 
slope of the inhibitory function. 

First we would like to show that 
the actual shape of the test-figure dis- 
tribution is not a critical variable. In 
Figure 10 we illustrate the cross sec- 
tion of a straight contour, T, which 
has a width —ito +i. Let the function 
which relates this cross section to the 
variable x on the abcissa be f(x). In 
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a control test in which the inducing 
figure is not presented, the mean of the 
test-figure distribution is given by: 


z= L x f (x)dx. 


Then the net distribution will be: 


net i — L x[f(x) — I.]dx. 


Substituting Equation 2 into this ex- 
pression, integrating, and collecting 


terms, and letting b —log L/m, we 
get: 


netz = i — 3073. 


By definition, contour displacement, D, 
is given by the mean of the net distribu- 


tion less the mean of the control dis- 
tribution, hence: 


D = —&bi. [3] 


Since f(x) is not in the equation, it 
can be seen that the displacement. is 
independent of the particulars of the 
test figure’s distribution, although it is 
sensitive to the width of the test figure 
Moreover, the amount of displacement, 


D, is proportional to the slope of the 


inhibition function, b. And since b 


equals log L/m, we see that contour 
displacement is (a) proportional to the 
log of the inducing contour luminance 


and (b) inversely proportional to the 
spread of the inhibition, m. 


In the treatment below, we use the 
convention that the inducing figure is 
placed at the origin and that the test 
figure is placed a distance, S, awa 
If that is the case, and using a mon 


figure of unit width, we get the follow- 
ing distribution: 


Test-figure position 


Displacement 
S<-—m 0 
—m<S<0 —b 
0< Sem b 


mt&s 0 
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Remembering that the inhibition’s slope 
is negative on the left of the inducing 
figure edge and positive on the right 
side of the inducing edge, this means 
that a repulsion displacement occurs 
on both sides of the inducing figure, 
and that it is always equal to the slope 
of the inhibitory function of the same 
strength, if only the test figure is pre- 
sented within the region from —7 to 
+m. 


Role of Eye Movements 


A subject's eyes move even though 
a fixation point is provided, and he 
makes his best effort to fixate steadily 
(Ratliff & Riggs, 1950). The sub- 
ject's eyes move during the inspection- 
to-test-figure interval, drifting away in 
all directions from the subject's own 
mean fixation position. These eye 
movements have the effect of changing 
the true interfigural distance at any 
moment. This must mean that when 
the results of a block of trials are com- 
bined for any one interfigural distance 
value, set by E, the results actually 
represent the pooling of a whole distri- 
bution of interfigural distance values. 
This distribution will be very broad 
if the subject drifts a lot from the fixa- 
tion point and will be quite narrow if 
the subject has more precise fixation. 
What then can we expect the subject's 
eye-movement drifts to be? 

Barlow (1952) has taken a measure 
of the error involved in O’s successive 
fixational pauses. The scatter of eye- 
movement positions when the subject 
indicated that he was satisfied with his 
fixation was 6.5 minutes of arc (5.2 
minutes of arc after correction for 
changes in head position). Two other 
direct-measurement techniques yielded 
values ranging from 6.5-5.7 minutes 
before correction. A subjective 
method, in which O estimates the verti- 
cal distance between his fixation point 
and the afterimage of a previous fixa- 


tion point, yielded a standard-deviation 
value of 6.3 minutes of arc. 

We are now in a position to examine 
the figural aftereffect situation. Let 
cœ be the standard deviation of the sub- 
ject's fixation position in the horizontal 
plane. From Barlow's data, o equals 
about 5 or 6 minutes of arc. We con- 
sider the situation in which E sets the 
average interfigural distance at some 
value less than o. If the inducing fig- 
ure has been presented left of the test 
figure, then, during postinspection, the 
test figure will be displaced an amount 
b to the right (repulsed) whenever the 
eye movements bring the test figure 
momentarily right of the edge of the 
afterimage of the inducing figure (but 
not more than ;z: away from I). A 
definite probability can be attached to 
the event that an eye movement mo- 
mentarily brings the test figure left of 
the afterimage edge (but not more than 
i1 away from that edge), and then the 
displacement will be a repulsion an 
amount b, but in a leftward direction. 
Thus, the momentary displacement is 
not determined, as in previous theories, 
but is probabilistic, depending on the 
errorlike vagaries of fixation scatter. 
For an alignment response, which nor- 
mally requires multiple fixations, the 
expected displacement would be the 
sum of the products of displacements at 
various ¢-figure positions each multi- 
plied by the probability of the ¢ figure 
assuming such a position, Assuming 
the distribution of fixation positions to 
be normal, with a standard deviation 
defined as e, we get as the equation 
of expected displacement’, Ds: 


? For computation, it is convenient to use 
the following transformation: 
m-s 


S — 
Dg t [2 f Ve edy + i j Vee aiid 
g "T o 2i 
m+S 


= i evar), 
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Pollack 


"x 


N 


Displacement in minutes of orc 


Interfígurol Distance (mins.cf arc) 


Fie. ll. A comparison of the distribution 
of figural aftereffects, as they are predicted 
by the inhibitory model (c=5; m = 4) 
with the results of an experiment (Pollack, 


1958). 


b m | (x—S)? 
Dg= Uf e 2 dy 
oV2r Lio 


o _ (x—S) 
-[ e 2g? d|, [4] 


We have plotted the expected displace- 
ment when m = 4 minutes of arc, and 
v — 5 minutes of arc (after Barlow's 
results) in Figure 11. It compares fav- 
orably with Pollack's (1958) data on 
figural aftereffects which are also shown 
in the same figure. It can be seen that 
the course of the displacements as we 
vary the interfigural distance is pre- 
dicted to a first approximation by the 
theory. The prediction is made on the 
basis of only one completely free 
parameter, the constant L which relates 
the contrast of the inducing figure to 
its inhibitory potential, the parameter 
m being undetermined but bounded by 
empirical data, and the parame 


ter g 
being completely determined, and 
logL 
ĝa, 
m 
where i = 1 if o < S <m, and i = =i GF 


m < S. The bracketed functions are the 
integrals of the error function: 


z 
e dy, 
o 


which has been tabled (United States Na- 
tional Bureau of Standards Computation 
Laboratory, 1954). 


Furthermore, the free parameter L is 
a vertical scale factor and does nct 
otherwise affect the curve. ] 
It might be of interest to note, with 
regard to the parameter m, that Mach 
bands, the band of black near a border 
with white and a band of white near 
a border with black, have their maxi- 
mum at about 2-6 minutes of arc 
(Fiorentini & Radici, 1958) which is 
about the same location as the point of 
maximum displacement. The Mach 
bands are almost certainly an inhibi- 
tion-disinhibition phenomenon (Rat- 
liff & Hartline, 1959). A recent theory 
by von Békésy (1960) of the Mach 
bands, from a schematized summation- 
inhibition neural unit, used a value of r 
(roughly equivalent to m in our 
theory) of 10 minutes of arc. The 
Leibowitz et al. (1953) data would in- 
dicate that the greater proporti 
inhibition is dissipated in the f; 
minutes of arc, 
tween the suggest 
utes of arc from 
and von Békésy st 
utes of arc whic 
present model pr, 
linear model is t 


on of 


rst 10 
The discrepancy be- 


ed values of 10 min- 
the Leibowitz et al. 
udies and the 4 min- 
h is optimal in the 
Obably means that a 

© that extent inade- 
quate. Although an exponential decay 
function of inhibition fits the empirical 
data on inhibition and figural after- 
effect better, the linear model has the 
advantage of simplicity. However, in 
the absence of more extensive data on 
the  displacement-distance function, 
more precision does not seem war- 
ranted at present. Moreover, it is our 
Primary interest to Show how much 
of the variance can be accounted for 
by the Simplest model. 


Effect of an Inducing Figure on the 
ance of Alignment Judgments 

The inhibition model can generate 
independent and new prediction 

Ch is at variance with the expecta- 

"S based on prevalent theories of 


an 
whi 
tio: 
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aftereffects. The inhibition model 
specified that eye movements some- 
times place the retinal projection of a 
test-figure contour on one side of the 
inducing-figure contour, sometimes on 
the other. This will occur a significant 
proportion of the time when the inter- 
figural distance is less than two or three 
times the standard deviation of the 
€ye-movement distribution, c. The 
displacement of the test-figure contour 
occurs in opposite direction on the two 
sides of the inducing-figure contour. 
Thus, the apparent positions of the test 
line will be more variable when the in- 
terfgural distance is small. This is 
because, at a small interfigural distance, 
the test figure is successively placed 
on both sides of the inducing-figure 
contour and thus becomes prone to 
displacements in opposite directions. 
This in turn will give rise to a bimodal 
distribution of apparent contour posi- 
tions, a distribution that has a higher 
variance than a unimodal distribution. 
At large interfigural distances, the test 
figure will cross the inducing-figure 
boundary only rarely. Thus, it will be 
exposed to displacements predomi- 
nantly in one direction. This, in turn, 
will produce a unimodal distribution 
which would result in a smaller vari- 
ance of alignment judgments. 

We can derive an explicit expression 
for the variance of O's contour align- 
ments solely attributable to the effects 
of the inducing figure. In a control 
run when no inducing figure is shown, 
O's alignments are some direct func- 
tion of the mean of the excitation dis- 
tribution which the test figure arouses, 
X. If E places an inducing figure at a 
particular interfigural distance, S, then 
the expected displacement would be Ds 
(see Equation 4). Thus, O's average 
alignment would be a direct function of 
7+ Ds. If the test figure is presented 
briefly (,1 second or less; a brief 
enough presentation so that eye move- 
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ments would not cause significant dis- 
placements during the time interval) 
and happens to fall right of the induc- 
ing figure's afterimage (but not more 
than 1 minutes away), it would be dis- 
placed an amount b. Then the ap- 
parent position on that trial would be 
given by x +b. Thus, the deviation 
from O's average judgment, attribut- 
able solely to the displacement, would 
on that trial be given by (¥+ b)— (7 
+ Ds), or (b — Ds). Depending on 
where the test figure falls, during that 
brief presentation, we have the follow- 
ing deviations: 


Test-figure position Deviation 
<= im 0 — Ds 
-m {x <0 —b — Ds 
Ocxt£m b — Ds 
m<x 0 — Ds 


The expected variance, for a specific 
interfigural distance, S, is then given 
by the product of each of these devia- 
tions squared and their probabilities : 


(x-5)? 
T z dx 


1 o 
Vs=(6+Ds)? f e 


(x-5)? 


1 " x 
+0-Ds = | e 2 dx. [5] 
oN2T Jo 


If we let 


b — —Y 
po= f e 


then Equation 4 becomes 
Ds = b(P," — Pn’). 


Substituting into Equation 5 and 
collecting terms, we get 


Vs = (4d Ds) Pa”. [6] 


Substituting the constants we used 
above to compute Ds into Equation 6, 
we obtain a function which is plotted 
in Figure 12. The figure shows a 
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Fic. 12. The standard deviation of vernier 
alignment judgments (control value) and of 
vernier alignments when one of the lines is in 
the vicinity of an inducing figure, (As the 
distance separating inducing from test figure 
is made smaller, the standard deviation is 
seen to increase, especially in the successive 
presentation. Predicted values, derived from 
the inhibitory model, using the same param- 
eter values as in Figure 11, are also shown.) 


thoretical function which peaks some- 
where at a very small interfigural dis- 
tance, the value at an interfigural 
distance of 0 being indeterminate, and 
then drops exponentially reaching a 
value which is approximately equal to 
0 at about 30 minutes of arc. T here 
are, of course, other sources of variance 
in O's judgments. These other sources 
can be estimated from alignment judg- 
ments performed in the absence of in- 
ducing figures, The different variance 
sources will then simply add in the 
figural aftereffect situation, 

Since we are interested in the varia- 
bility due to the scatter of momentary 
test-figure placements, we want to 
compare our theoretical function to 
data obtained in experiments in which 
the test figure has been Presented very 
briefly. If the test figure is presented 
for a longer time, then O can sample 
a number of eye-movement Positions, 
and thus the variability of test-figure 
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positions due to eye movements will 
be reduced as the square root of the, 
number of samples. This follows since 
a judgment would then be based effec- 
tively on the mean number of positions. 
Thus, the mean would represent the 
standard deviation of a sample mean, 
that is, the standard error. The results 
of an experiment (Ganz, 1964) in 
which the test figure was presented for 
.12 second is shown in Figure 12. 
The descending character of the stand- 
ard deviation in the successive presen- 
tations as S is increased is 


quite 
evident. A second study by Ganz 
and Day (in press) utilized a method 


of limits in which each stimulus presen- 
tation lasted .3 second, The results 
are shown in Figure 13, The standard 
deviation again is seen to fall as S is 
increased. In absolute level, the stand- 
ard deviation is much smaller, but if 


we remember that each exposure was 
about 3 times a 


S long and that each 
threshold was determined by 4 runs 
(ABBA in order), then it would ap- 
Pear that we should multiply these 
values b 


1 y the square root of 12 to ob- 
tain 


& comparable estimate of the 
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Fic. 13. The standard deviation of vernier 
alignment judgments at various distances 
from the location of a previously presented 
inducing figure. (A signifies vernier judg- 
ments obtained with a method of average 
error where the test figure is presented for 


an undetermined duration; L signifies the 
use of a method of limits, with a .3-second 
test-figure exposure. Compare with Figure 

; Where shorter exposures were used.) 


——- 


MECHANISM Or FiGURAL AFTEREFFECTS 


standard deviation on single brief ex- 
posures. When this is done, the height 
of the theoretical curve and of the 
curve from the study by Ganz and Day 
become quite comparable (the results 
depicted have not been so adjusted). 
Finally, the data of Pollack (1958) and 
of Ganz and Day (in press), both 
studies using a method of adjustments, 
are shown in the same figure. Here 
the method of adjustment was used, 
and in these cases it is not possible to 
estimate the duration of the test-figure 
exposure period. ^ Notice, however, 
that both studies yielded descending 
functions as the parameter S increased. 
Also, both functions are slightly lower 
in magnitude than the function obtained 
with the method of limits. This is un- 
derstandable when it is realized that 
the method of adjustment probably 
gives the subject the longest stimulus 
exposure of all these conditions. 

It is felt that the prediction of in- 
creased standard deviation of alignment 
near the inducing-figure contour border 
is specific to an inhibitory model such 
as the one presented here. It does not 
follow from the other theories of the 
figural aftereffect (Köhler & Wallach, 
1944; Osgood & Heyer, 1955) where 
displacements are held to be very 
small near the inducing-figure border. 
Thus, the oscillations of the test figure 
on both sides of the contour of the 
inducing figure when the interfigural 
distance is very small would not lead to 
greater variations in alignment. These 
theories predict greater variability of 
alignment where the slope of the dis- 
placement function is steepest. This 
occurs at least at two points, each the 
same distance from the inducing fig- 
ure's edge on either side, and at the 
point of maximum displacement. Put 
differently, at O interfigural distance, 
the standard deviation of the displace- 
ment should be about the same as at a 
very large interfigural distance since in 
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both cases there is very little displace- 
ment. Butas Figures 12 and 13 show, 
this is in fact not the case. Only a 
theory in which there is strong dis- 
placement near the interfigural dis- 
tance of zero, as in the present model, 
could predict the type of descending 
standard-deviation function we have 
found. 

Notice that our equation for Ds im- 
plies that the paradoxial distance effect 
is not inherent in the displacement phe- 
nomenon. It evolves from intrinsic 
errors that arise from the oscillation in 
the actual momentary test-figure posi- 
tion. These are caused by eye move- 
ments and other errors of fixation, or 
even possibly from errors in E’s posi- 
tioning of the test figure from trial to 
trial The latter seems a reasonable 
possibility since 1 or 2 minutes of arc 
are frequently close to the limits of 
error of the apparatus employed (Ganz, 
1964; Kóhler & Wallach, 1944; Pol- 
lack, 1958). 

In Taylor’s (1962) theory the dis- 
placements occur because the subject’s 
alignment judgments are better in the 
presence of the inducing figure. Tay- 
lor feels that the inducing figure acts 
as an anchor, and this effect is stronger 
when the interfigural distance is small. 
It is unlikely, however, that more ac- 
curate judgments should come to be 
associated with greater variance. 

An observation was often made, es- 
pecially under conditions of very care- 
ful fixation (Ganz, 1964), that during 
the postexposure period the apparent 
position of the test figure changed while 
O observed it, often from one side of 
the inducing figure’s edge to the other 
side. This is not accounted for ex- 
cept in a model that includes the 
effects of sudden corrective eye-move- 
ment saccades which might bring the 
test figure to the other side of the in- 
spection contour during the observation 


period. 
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Fic. 14. The effect of varying the scatter 
of horizontal eye movements (c) on the 
distribution of figural aftereffects, as derived 
from the inhibitory model. 


If our theory is correct, the distribu- 
tion of displacements should change if 
the scatter of eye movements is altered. 
One experimental way of accomplish- 
ing this is to stabilize the retinal image. 
The result of such stabilization should 
be to restrict the variance increases of 
the alignment judgments largely to the 
region in the immediate vicinity of the 
inducing-figure edge. This would be 
the result predicted from Equation 6 if 
the parameter c is reduced in value. 
This is shown in Figure 14. An ex- 
periment on image stabilization (Ganz, 
1964) confirmed this expectation. The 
results of this experiment are given in 
Figure 12: successive presentation, sta- 
bilized. 

We should also consider the case 
where, although eye movements are not 
reduced, the test figure does not vary 
in its position with respect to the induc- 
ing figure. This would be the case 

when the inducing and test figure are 
presented together. In such a situa- 
tion, the increase in alignment varia- 
bility at small interfigural distances 
should be absent. In Figure 12 the 
data obtained under the simultaneous 
presentation show only very small vari- 
ance increments. To this extent our 
expectations appear to be confirmed. 


Figural Aftereffects as Simultaneous 
Illusions 


The theoretical model presented 
here considers the displacements which 
Occur in the figural aftereffect situa- 
tion to be a consequence of a simultane- 
ous illusion—mutual contour inhibition 
—brought about by the decaying trace 
of the afterimage upon the test contour. 
We have shown that a theoretical 
model of mutual contour inhibition 
does predict displacements of appropri- 
ate location and variability. One 
would like to know whether such dis- 
placements occur when an actual in- 
ducing-stimulus Object is substituted 
for the postulated afterimage. In 
other words, do simultaneous illusions 
Occur which are similar in direction 


and magnitude to the figural after- 
effect? 


ce and direction 


ner, 1960). In 
depicted the figu 
tions which the 


, the standard deviation 
9n during fixation, is 


eper, and its magnitude 
In Figure 4 we see the 
results of an experiment with 30 sub- 
Jects in which the inducing and test 
figures were Presented simultaneously 
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(Ganz, 1964). Two successive series 
of measurements are shown. Pollack’s 
(1958) experiment in which similar 
figures were presented in the aftereffect 
paradigm is also included in the same 
figure for comparison. We see that 
the simultaneous experiment did yield 
a point of maximum displacement 
nearer to the inducing-figure edge (1 
minute of arc) than the value obtained 
from Pollack’s data (about 4.5 minutes 
of arc) or to any figural aftereffect ex- 
periment known to the author. 

It should be noted, finally, that the 
Inhibition Model predicts that contour 
displacements will occur with or with- 
out fixation tremor. It is only the 
magnitude of the displacements and 
the point of maximal displacement 
which change with such errors of fixa- 
tion. "Thus, the theory is not incon- 
sistent with the fact that figural after- 
effects are found under conditions 
of retinal image stabilization (Ganz, 
1964; Hochberg & Hay, 1956; Kraus- 
kopf, 1960). 


SUMMARY 


The edges of objects in the visual 
field appear to repulse one another 
when they are brought close together. 
These mutual shifts have long been 
known to us in the case of the classical 
geometrical illusions. Such effects can 
be obtained even when a pair of neigh- 
boring contours is presented succes- 
sively. These are known as the figural 
aftereffects. 

A quantitative theory is proposed to 
account for these perceptual aberra- 
tions. The two aspects of the after- 
effect—the shifts in contour position 
and the temporal separation which can 
be introduced between inducing and 
test contour—are treated separately. 
According to the theory, the displace- 
ment arises because the neural corre- 
lates of visual contour inhibit one 
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another. The inhibition is proportional 
to the log of the contour intensity 
(more exactly to its contrast above or 
below background level). The inhibi- 
tion is a decreasing linear function of 
the separation between the contours. 
These two assumptions, both docu- 
mented psychologically and physiologi- 
cally, are sufficient to generate a 
predicted contour displacement. Eye 
movements during fixation, also docu- 
mented, are shown to produce a sta- 
tistical distribution of repulsions at 
various intercontour distances. By 
introducing empirically derived values 
for the equation parameters, a distri- 
bution of figural aftereffects is gen- 
erated which agrees well with experi- 
mental values. 

Two independent sets of predictions 
are generated by the model. First, 
the model predicts a very steep rise 
in response variability as the test con- 
tour is brought closer to the inducing 
contour (particularly when the two 
are presented successively). Second, 
certain changes in the mode of presen- 
tation—retinally stabilized, simultane- 
ous presentation, short test-figure 
presentation—are predicted to yield 
different distributions of figural after- 
effects. The data available to date 
corroborate the predictions. 

The mechanism which bridges the 
time gap is described only briefly. We 
hope to consider it in more detail in 
a subsequent communication. The fig- 
ural aftereffects are, it is believed, 
simultaneous illusions which are in 
many ways no different from the clas- 
sical geometrical illusions. When the 
inducing figure is removed, an after- 
image of the inducing figure remains 
in O's visual field. The neural corre- 
lates of that afterimage then repulse 
the test figure’s neural correlate, in 
accordance with the displacement 
mechanism just described. The lines 
of evidence which support an after- 
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i explanation are strongly sug- 
me m hanhi perhaps not conclusive. 
That is, it is found that the onset, the 
initial intensity, and the decay of fig- 
ural aftereffects and afterimages are 
strikingly similar, often involving the 
same parameter values. They, in turn, 
both appear to reflect the operation of 
light- and dark-adaptation mechanisms. 
Thus, it appears possible that most 
of the quantitative properties of the 
figural aftereffects will be reduced to 
the operaton of a number of basic 
visual mechanisms : adaptation, inhibi- 
tion, and ocular tremor. 
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Current theories of sexual motivat 


ion emphasize the hormonal determi- 


nants of animal motivation and the experiential determinants of human 


motivation. The present theory s 
Sexual motivation are controlled 
determinants. 
ponents, sexual arousal, which re: 


uggests that both human and animal 
by both biological and experiential 


Sexual motivation is said to be comprised of 2 com- 


presents the current level of sexual 


excitation, and sexual arousability, the propensity for arousal. The 


sexual motivation of an individual 


of the state of these 2 components. 


presence and absence of relevant 


may be represented by a specification 
Arousal is modulated by the 
stimuli. Arousability is determined 


by hormonal state, feedback effects of copulation, and experience. The 


present theory stresses the diffe 


rence between the state of sexual 


motivation and the expression of sexual behavior. 


Students of human behavior usually 
find little relationship between that 
aspect of human sexuality which inter- 
ests them and the sexual behavior of 
animals (e.g, Hardy, 1964). It is 
common, for example, to contrast the 
immediate cessation of mating in fe- 
male rodents following ovariectomy 
with the occasionally reported increase 
in libido of female humans following 
removal of the ovaries. It is also 
Common to contrast the congruence be- 
tween type of gonad and mating pat- 
terns in animals with the absence of 
Congruence between type of gonad and 
gender role displayed by some human 
hermaphrodites, It is uncommon, 
ers for students of human be- 
ha P compare analogous facets of 
yu. animal sexuality. Libido 

pared with lordosis, the reflex 


1The ideas ex, 
research supported by grant grew out of 
from the National nts and fellowships 
ral Institutes of Mental 
Health and Child Health and Hu D 
velopment. Although these See : ia 
greatly influenced by my training and NC 
ation with Frank A. Beach, only I can es 
held responsible for them. J thank J, A 
Deutsch for keeping me out of one or two 
theoretical cul-de-sacs through his thought. 
ful reading and criticism of the manuscript. 


posture assumed by estrous female rats 
and cats when mounted by the male, 
even though libido and lordosis repre- 
sent quite different aspects of sexual- 
ity. Libido is not compared with the 
sexual desire of the female rat as meas- 
ured, for example, by the latency to 
mate when the female is placed with 
a male, or by her willingness to work 
for, or to accept pain to reach, a male. 

The failure to compare analogous 
facets of human and animal sexuality 
seems to arise from three facts. First, 
human sexual motivation is usually 
inferred from subjective reports, while 
animal sexual motivation is usually 
inferred from the reliable observation 
and measurement of discrete compo- 
nents of behavior? Second, students 
of human behavior discuss sexuality in 
motivational terms, such as libido, 
while students of animal behavior dis- 
cuss sexuality in response terms, such 
as intromission latency or frequency. 
Third, the motivational terms used to 
describe human sexuality are often am- 


i based upon 
? Some of our inferences are p 
subjective reports of human sexual behavior. 
Although such reports are often inaccurate, 
they provide the only data we have which 
are related to human sexual motivation. 
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iguous. Libido is rarely defined in 
Fg cade Hardy (1964) uses the 
expression “true sexual desire (lust) 
yet fails to provide an adequate defini- 
tion of this expression. It is certainly 
not clear that libido and lust are 
equivalent terms and could be meas- 
ured by the same scales. 
In the past, a similar problem 
plagued students of animal behavior 
(cf. Kuehn & Beach, 1963). Discrete 
and different responses within the mat- 
ing pattern were not discriminated, and 
all mating responses were used equally 
to infer motivation. However, recent 
research has shown, for example, that 
the mounting behavior of the male rat 
contains three components—mounting 
without intromission, mounting with 
intromission, and mounting with ejacu- 
lation—and that the occurrence of 
each of these responses has a different 
effect upon subsequent mating (Beach 
& Whalen, 1959a, 1959b; Whalen, 
1961). The frequency and timing of 
each of the mounting responses seem 
to be related to sexual motivation in a 
different manner, 
Attempts have been made to solve 
some of these definitional problems. 
Money (1961) attempted to delimit 
the concept of human sexual desire, 
which he considers equivalent to libido, 
when he said, “sexual desire is mani- 
fested in erotic imagery, sensation, re- 
port and action [p. 241]." Although 
we might like a more complete descrip- 
tion of these terms and Statements 
relating these terms to their measure- 
ment, the definition offereq by Money 
is useful since it excludes some aspects 
of sexuality from the concept of sexual 
desire. For example. the loss of ejacu- 
latory ability following gonadectomy 
would not indicate a loss of libido, 
even though the loss of ejaculatory 
ability might result in subsequent de- 
cline in sexual desire. Libido, ejacula- 
tion, gender role, and masturbation are 
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all part of human sexuality, but they 
are not all part of sexual motivation. 


DIMENSIONS or SEXUALITY 


Of primary importance for the study 
of human sexuality is a discrimination 
of the components of sexuality. Sexu- 
ality is not a unitary dimension which 
is determined in any simple way by 
hormones or by experience. In the 
past, several classificatory schemes 
have been proposed for the analysis of 
sexuality. The one which we will 


adopt is similar to that proposed by 
Winoker (1963), 


1. Sexual Identification : 
role, male or female, of an 
Masculinity and femininity are related 


to gender role, but are not exclusively 
determined by it. 


2. Object Choice : 
objects toward wi 


the gender 
individual. 


those persons or 
hich an individual di- 
Tects his sexua] activities, be these 
intercourse, fondling, or fantasy, 

3. Sexual Gratification : the rein- 
forcement, reward, or "pleasure" asso- 
ciated with or caused by sexual ac- 
tivities. 

4. Sexual Arousal : 
level of sexual excitatio 

5. Sexual Arousability : an individ- 
ual’s characteristic rate of approach to 
orgasm as a result of sexual stimula- 
tion. It must be emphasized that 
sexual arousability is a rate and not 
a state function. 

6. Sexual Activity: sexual behavior 
exemplified by reported fantasy 
desire as well as by observed or 
ported overt behavior. 


In the discussion which follows we 


will be concerned primarily with sexual 
arousal 3 


and arousabilit at 
these tw. es ais 


© dimensions of sexuality com- 
prise sexual motivation. 


Unfortunately, 
like other motiva 
not be measure 


the momenta ry 
n. 


and 
re- 


Sexual motivation is 
tional States—it can- 
d directly, Sexual 
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arousal and arousability must be in- 
ferred from behavior. This fact raises 
a major problem. Behavior is not 
simply a function of motivational state ; 
behavior is a function of some inter- 
action between motivation and habit 
(Wittenborn, 1957). In species such 
as the rat where habits usually con- 
tribute little to sexual behavior ( Rabe- 
deau & Whalen, 1959), measures of 
the behavior may allow relatively direct 
inferences about motivation. In carni- 
vores and primates, habits greatly in- 
fluence the expression of sexual be- 
havior (Harlow, 1962; Rosenblatt & 
Aronson, 1958; Whalen, 1963a). In 
such cases the validity of our inferences 
about sexual motivation is reduced. 
Because of this difficulty we must seek 
measures of behavior which are either 
little influenced by habits or experi- 
ence, or we must study behavior after 
habits are firmly established. In short, 
we feel that while behavior reflects 
motivation, the course of behavior is 
not part of the motivation. This ap- 
proach to sexual motivation differs 
oo that of Hardy (1964) who feels 
at: 


For an expectation to be motivational in 
character, it must involye not only affective 
change, but also a course of behavior, sym- 
bolic or overt, which is instrumental to at- 
taining the desired affect [p. 16]. 


: SExvuaL MOTIVATION 
rousal and Arousability 


The ex 1 
" pression **. ivati d 
is co 1 sexual motivation 


: y thought of in tw 

only tt o ways. 
First, am individual is said to be mat 
ally motiva dh 


ted at s 

4 $ ome particular time. 

This transient component of sexual 

motivation, which we have termed 
rme 


ry arousal, represents the current 
evel of sexual excitement. Arousal 
may range from no excitement to a 
maximal level of excitement. Maxi- 
mal arousal is characterized by he 
occurrence of orgasm. Arousal may 


be measured by the probability of re- 
sponse, latency of response, frequency 
of response per unit time, and so forth. 
The second facet of sexual motiva- 
tion we have termed sexual arousa- 
bility. Arousability is defined as the 
rate at which an individual approaches 
maximal sexual arousal. Arousability 
is determined not only by the amount 
of increment in arousal which is pro- 
duced by each individual sexual stimu- 
lus, but also by the absolute number 
of sexual stimuli which simultaneously 
arouse the individual, and the number 
and effectiveness of stimuli which are 
negatively arousing (or sexually in- 
hibiting). It should be clear that a 
person characterized as highly arousa- 
ble may not become aroused at some 
particular time because of the absence 
of positive stimuli or the presence of 
negative stimuli. Net arousability may 
be measured by the number of stimuli 
which are needed to produce orgasm. 
Figure 1 illustrates the effects of 
applying a single sexual stimulus to 
individuals of high and low arousa- 
bility. Each stimulus produces an im- 
mediate increment in arousal which 
reaches a peak and then decays in time. 
If a second stimulus is applied before 
the effects of the first stimulus have 
completely decayed, sexual arousal will 


ORGASM 


AROUSAL 


HIGH AROUSABILITY 


TIME 
Fic. 1. Increments in and decay of arousal 
produced in individuals of high and low 
arousability by à single sexual stimulus. 
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AROUSAL 


LOW AROUSABILITY 


SUCCESSIVE STIMULI 


Fic. 2. The growth of arousal in individ- 
uals of high, medium, and low arousability 
produced by a series of consecutive sexual 
stimuli. (In two instances arousal reaches 
its maximum level and Orgasm occurs.) 


progressively increase. Arousal would 
accumulate more rapidly for the high- 
arousable individual than for the low- 
arousable, both because each stimulus 
would add relatively more excitation 
and because the decay of arousal would 
require more time. 

Figure 2 illustrates the smoothed 
growth of arousal as a function of the 
number of successive stimuli for indi- 
viduals of high, medium, and low 
arousability. In this figure the indi- 
viduals of high and medium arousa- 
bility would reach orgasm, but at dif- 
ferent rates, while the individual of low 
arousability would approach, but never 

reach, maximal arousal. For this last 
individual, each stimulus would add 
very little to arousal, and since arousal 
produced by a stimulus would begin 
to decay immediately, stimuli could not 
follow one another rapidly enough for 
the accumulated arousal to reach maxi- 
mum. 

That the effects of Successive stimuli 
do accumulate, do lead to Orgasm, and 
do decay with time has been suggested 
by work with lower animals (Beach 
& Whalen, 1959a, 1959p; Bermant, 
1964; Larsson, 1956). Beach and 
Whalen (1959b) showed that for the 
male rat successive intromissions add 


approximately equal increments to the 
excitation needed to produce ejacu!- 
tion, and that as the delay between one 
intromission and the next increases, 
the amount of excitation decreases. If 
intromissions are spaced far enough 
apart in time, ejaculation never occurs 
(Larsson, 1956). These effects of 
spacing intromissions may be thought 
to reflect changes in arousal illustrated 
in Figures 1 and 2. Observed indi- 
vidual differences in the number of 
intromissions needed to produce ejacu- 
lation may be thought to reflect indi- 
vidual differences in Sexual arousa- 
bility. 

We are stating that, given a constant 
habit level, sexual arousal and arousa- 
bility may be estimated from measures 
of sexual behavior. Tt should be clear 
that our measures of arousal and 
arousability are not entirely independ- 
ent. If arousal leve] is not known, 

cannot be estimated accu- 


1 example, a high] 
individual may reach org ice 


little sexual stimulation, If one did 


d 1 Tgasm with little 
sexual stimulation, In Practice, arousal 


may be discriminated from arousabil- 
ity, and one must take care to do so, 


DETERMINANTS OF ARousar 
AND ÁROUSARILITy 
Arousal 


For an individual of 
level of arousability, į 


stimuli, A 
„9f stimuli can induce 
igh- as compared with 
le individual, 


arousal in the h 
the low-arousah 
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Sexual arousal is influenced by stimuli 
in all sensory modalities—vision, touch, 
hearing, etc. No one modality is criti- 
cal for arousal (Ford & Beach, 1951). 
Furthermore, individuals become con- 
ditioned with respect to which stimuli 
are particularly effective in inducing 
arousal. Thus, while nearly everyone 
is responsive to tactile stimulation, only 
some individuals are responsive to 
erotic visual stimuli, and some indi- 
viduals to only some forms of visual 
stimulation. These multimodality and 
conditioned individual difference fac- 
tors prevent predictions about effective 
arousing stimuli in any particular in- 
Stance. 


Arousability 


_ As with arousal, sexual arousability 
is determined by learning and by the 
physiological state of the organism. 
The physiological determinants of 
arousability seem to be of two kinds. 
First are the hormonal determinants of 
arousability, 

The prevalent notion today is that 
humans differ from other animals in 
their dependence upon hormones for 
their sexual behavior, Tt is accepted 
that castration has little effect upon hu- 
man sexual motivation. Particular in- 
dividuals are said to report either 
Increased or decreased libido following 
gonadectomy, However, there is evi- 


dence whi 
ch i ; : 
Slits that contradicts this notion and 


sult in a 


; ctivit 
cent studies confirm thi? Dad 
sion. 


For example, Breme 

pleted an extensive statis ae $0 aale 

who had been legally castrated in Ne » 

way. Of the 157 cases on whom Mr 

Was sufficient information, 74 reported 
asexualization” shortly after castra- 
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tion, and 29 others reported “asexual- 
ization" within the first year after 
operation. Only 54 individuals re- 
ported some sexual interest and ac- 
tivity after the first postoperative year. 
Bremer concluded: 


It can be stated at the beginning that in all 
cases without exception the amount of sexual 
activity has been altered. It has been re- 
duced or abolished irrespective of the (pre- 
castrational) direction or the form of the 
sexual urge . . . [p. 67]. 


Since many of these individuals con- 
tinued to be exposed to sexual stimuli 
yet failed to become aroused, the re- 
ported reductions in sexual activity 
may be assumed to reflect reductions in 
sexual arousability. 

Studies of hypogonadal men under 
testosterone therapy also support the 
hypothesis that the male hormones, 
androgens, exert a major influence 
upon sexual arousability. Money 
(1961) after an examination of 11 
such cases concluded that: "In each 
case, the absence of androgen from 
the tissues made a decided difference to 
sexual desire and sexual performance 
[p. 243]. A reduction in the fre- 
quency of erotic dreams and imagery as 
well as a reduction in the frequency of 
the initiation of intercourse was noted. 
Castration and hypogonadalism, there- 
fore, reduce the expression of sexual 
behavior. We assume, tentatively, that 
these reductions reflect changes 1n 
sexual arousability. Additional data 
are needed for this assumption to be 
verified. 


Of course, one wo 
conclude, on the basis of these data, that 


the expression of sexual behavior 1s 
solely under the control of androgenic 
hormones. As noted earlier, experi- 
ence and habits do influence the ey 
pression of sexual behavior. Pr y 
experienced male cats (Rosenb! a 

Aronson, 1958) continue to bs or 
several months following gonadectomy, 


uld not wish to 
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and castrated and hypogonadal men 
continue to have erotic sensations, de- 
sires, and fantasies, and do masturbate 
and occasionally have intercourse ( Bre- 
mer, 1959; Money, 1961). Habits, in 
the absence of hormones, can mediate 
some sexual expression. The point we 
wish to stress is that in the absence of 
androgenic hormones the sexual arous- 
ability of men, as of other animals, is 
reduced. 

In female organisms the problem of 
the hormonal contribution to sexual 
arousability is more complex than in 
males. The simple assumption that 
estrogen directly controls arousability 
may not be true. Sexually naive fe- 
male cats primed with estrogen will not 
accept the male, even though they will 
assume the mating posture either spon- 
taneously or in response to perineal 
stimulation (Whalen, 1963a). In this 
case, the primary effect of estrogen is 
on the facilitation of the postural re- 
flexes necessary for mating, rather than 
on arousability per se. This may be 
an example of the effects of estrogen 
on mating in general. Estrogen may 
be crucial for mating only in those spe- 
cies in which unusual postural adjust- 
ments are required for successful mat- 
ting. A dissociation between estrogen 
secretion and mating seems to occur in 
those species in which unius 
are not required for mating. By "un. 
usual" postures we mean those pos- 
tures, such as lordosis, which have a 
low probability of occurrence in non. 
mating situations. Female monkeys 
and humans accept the male at times 
other than the time of maximal follicu- 
lar development and, when mating, as- 
sume postures which occur with mod- 
erate to high frequency at times other 

than mating. In these cases, estrogen 
clearly is not directly related to arous- 
ability. . 

Tn all species, estrogen may indi- 

rectly affect arousability through its 


ual postures 
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production of cytological changes in 
the vaginal epithelium and the con- 
comitant facilitation of vaginal lubrica- 
tion. Vaginal lubrication may make 
intercourse less difficult and painful and 
thereby contribute to arousability. 
The slight and inconsistent reductions 
in sexuality which are reported to fol- 
low ovariectomy (Bremer, 1959; Fuller 
& Drezner, 1944) may be accounted 
for in terms of changes in vaginal state 
rather than by changes in arousability. 

Although sexual arousability may be 
unrelated to estrogen levels in women, 
arousability may still be under hor- 
monal control. It has been suggested 
by a number of investigators that 
androgen is the primary libido hormone 
in women (Money, 1961). Waxen- 
berg, Drellich, and Sutherland (1959) 


found that female cancer patients who 
underwent therapeutic oy. 


ported some decrease in 
activity, and responsiven 
of a reduction in arous 
ever, the reported decr. 
were not nearly as profound as those 
reported following combined ovariec- 
tomy and adrenalectomy, Following 
the combined operation, 82% reported 
a decline in desire, 100% a decline in 
activity, and 92% a decline in re- 
sponsiveness. The authors suggest 
that these declines in sexuality were 
the result of the removal of the an- 
drogens which are produced by the 
adrenal glands. Although these find- 
ings are suggestive, one must 


ariectomy re- 
sexual desire, 
ess suggestive 
ability. How- 
eases in libido 


worry 
about the validity of reports from 
women with cancer, castration, and 


adrenalectomy. 


Studies of the effects of administered 
testosterone may be more valid. Foss 
(1951) examined 10 women who were 
being treated with 200-300 milligrams 
of testosterone daily. Five reported an 
increase in libido, or sexual craving, 
two others reported an increased fre- 
quency of intercourse, three noted only 
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an increase in the size of the clitoris, 
and only one individual reported no 
effect of the treatment. Similar data, 
obtained by Salmon and Geist (1943) 
from a study of 101 cases, support the 
notion that androgen, rather than estro- 
gen, determines sexual arousability in 
women. We are assuming in these 
cases that the androgen determines 
the responsiveness to erotic stimuli. 
In the absence of androgenic stimula- 
tion, arousability would be low, and 
stimuli would not act to increase 
arousal. 

It should be noted that the locus of 
the effects of androgen is still unclear. 
While androgen may be directly in- 
fluencing arousability by its action on 
critical neural centers, androgen may 
also be influencing arousability by its 
peripheral action. Androgen does 
cause growth of the clitoris (Foss, 
1951; Groome, 1939), and clitoral 
stimulation does lead to orgasm (Mas- 
ters, 1960). It is entirely possible 
that the reported increase in libido fol- 
lowing testosterone is the result of in- 
creased clitoral sensitivity and the re- 
sulting facilitation of orgasm (Carter, 
Cohen, & Shorr, 1947). Kupperman 
and , Studdiford (1953) concur with 
this interpretation, but also suggest that 
the androgen works by increasing gen- 
n "well being." Money (1961) 
e fiat androgen has both a direct 
Ted wh libido and an effect medi- 
clitoris. ncreased sensitivity of the 

Re 
aide Cao the mode of action of 
cate that horses 2 the data do indi- 

um es affect sexual s- 
ability. TI Hal aron 

y he false assumpti that 
estrogen must be the critical hot 2 
underlying arousability may eem 
investigators (e.g, Hardy, 1964) a 
the conclusion that hormones are in 
relevant to human sexuality. a 

Unfortunately, data are not avail- 
able which allow us to make statements 
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about the precise physiological relation- 
ships between hormones and arousabil- 
ity. The data which do exist, however, 
are consistent with the notion that 
hormones affect arousability by altering 
the threshold for erotic stimulation, re- 
gardless of whether the threshold in 
question is one of peripheral tissue 
sensitivity or of central neural sensi- 
tivity. It is also reasonable to assume 
that the hormones act centrally by de- 
termining the amount of change in 
arousal produced by a given stimulus, 
and act peripherally by determining the 
amount of receptor response to a stim- 
ulus, and/or the rate of adaptation of 
the receptor. Additional data are 
needed to verify these points. 

A second physiological determinant 
of arousability, one which is seldom 
discussed, is the feedback effect of sex- 
ual activity upon subsequent arousabil- 
ity. Extensive analyses of rat and cat 
mating indicate that the performance 
of sexual responses reduces arousabil- 
ity. This reduction in arousability oc- 
curs in two ways. For the male rat, 
each ejaculation seems to inhibit arous- 
ability, and several ejaculations lead to 
the cessation of mating (Beach & 
Jordan, 1956b). For the male cat, the 
feedback effects of ejaculation seem to 
be minimal. Reductions in arousability 
do accompany mating in this species, 
but these reductions seem to result from 
some process of stimulus adaptation 
(Whalen, 1963b). Inthe human male, 
ejaculation or orgasm seems to affect 
arousability as it does in the rat. Mas- 
ters and Johnson (1963) have shown 
that the human male is resistant to 
sexual stimulation immediately after 
ejaculation. Continued postorgasmic 
stimulation slows, but does not prevent, 


the detumescence process. 
It is important to note that at orgasm 


and the cessation of stimulation, arousal 
and arousability do not decay at the 
same rate, It is likely that, for the 
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INTENSITY 


ORGASM x 
TIME 


Fic. 3. Changes in arousal and arous- 
ability as a function of sexual stimulation and 
orgasm. (See text for full description.) 


male, arousability immediately falls to 
zero following orgasm, while arousal 
decays slowly. These changes are il- 
lustrated in Figure 3. 

At Time 0 sexual stimulation begins, 
Arousability is high, and arousal 
rapidly increases until orgasm occurs, 
At orgasm, arousability drops to zero 
for a short time and then slowly in- 
Creases. After orgasm, arousal slowly 
declines if sexual stimulation is discon- 
tinued. At Time X after orgasm arous- 
ability would be lower than before or- 
gasm but would be above zero, while 
arousal would still be relatively high. 
If the individual were restimulated at 
Time X, his rate of approach to a sec- 


from arousability. 
the condition describe 
hold for the male rat (Beach & Whalen, 
19592). Immediately following ejacu- 
lation, the male rat will not reinitiate 


quite high as measured by intromis- 
sion latency, and the second ejacula- 


tion occurs after fewer intromissions 
than did the first ejaculation, P 

This analysis applies only to the 
male. There is no evidence that or- 
gasm has similar feedback properties 
in the female. Masters (1960) has 
shown that orgasm does not necessarily 
alter arousability in women, Following 
an orgasm a woman may be maintained 
at a high “plateau” level of arousal 
and may be restimulated to a second 
orgasm in a short time. It is not 
known, however, whether the second 
orgasm is reached with more or less 
stimulation than the first, 


orgasm that chang 
are obscured, 


by stimulus- 
adaptation Processes. This position is 


in direct Contrast to that of 
(1964) who feels that: T 


While sexual arou 
a physiological basis 
enced by experience in the sense that 
Sexual arousal comes to Occur to an 
increasing number of stimuli with ex- 
perience, As pointed out by Hardy 
(1964), the adolescent is Jess likely 
to be aroused by movie love scenes 

3n i$ an adult who has experienced 
n in the ways de- 
Olescent matures and 
al gratification in a 
ons, new stimuli come 
ly arousing properties, 


sability does have 
» It is also influ- 


: As the ad 
experiences sexu 


Variety of situati 
to take on sexual 
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When the individual later returns to 
these situations, more stimuli are effec- 
tive in producing arousal than had been 
the case before conditioning, and the 
individual reaches orgasm more rap- 
idly. The arousability of the individ- 
ual would, therefore, be greater, the 
greater the number of stimuli which 
could be simultaneously effective. It 
should be remembered, of course, that 
experience may also attach negative 
values to stimuli such that some stim- 
uli may inhibit rather than increase 
arousal. Arousability would be low- 
ered in such situations. 


SEXUAL GRATIFICATION 


We are proposing that arousability 
is conditionable. If this is true, how 
is this conditioning effected? It is 
likely that stimulation of the erogenous 
zones is inherently reinforcing, and 
that neutral stimuli become conditioned 
arousers through their association with 
stimulation of the erogenous zones. 
The primary mode of reinforcement 
would be through genital stimulation as 
produced in the male by phallic contact 
or possibly as produced spontaneously 
during erection. That penile stimula- 
tion is inherently reinforcing has been 
suggested by work with lower animals 
(Beach & Jordan, 1956a; Bermant, 
1961; Kagan, 1955; Whalen, 1961). 
That orgasm or ejaculation is not 
[red for reinforcement to occur 

ue been demonstrated (Sheffield, 
1961). Backer, 1951; Whalen, 
talon ane clitoral and breast stimu- 

„r, Presumably reinforcing, but 
there is little evidence 5 ig, Du 

] o confirm this 
point. The female rat will | 
instrumental response in b pa an 
with a male (Bermant, 1961) peres 
suggests that some component = 
ad pattern is inherently reinforc- 
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It is clear that the stimulation pro- 
vided by orgasm is more reinforcing 
than that provided by intromission or 
clitoral contact. Women who achieve 
orgasm by masturbation are more likely 
to continue the practice of masturbation 
than are women who do not achieve 
orgasm in this way (Kinsey, Pomeroy, 
Martin, & Gebhard, 1953). Male rats 
are more likely to perform an instru- 
mental response to be with a female if 
they are allowed to ejaculate in the goal 
box than if only intromission is per- 
mitted (Kagan, 1955). These studies, 
however, do not show that orgasm has 
any unique reinforcing properties. It 
may be that the reinforcement charac- 
teristics of orgasm reflect only the 
summated reinforcement properties of 
the stimulation which is needed to 


produce orgasm. 


SEXUAL IDENTIFICATION AND 
OBJECT CHOICE 


The present paper has attempted to 
specify some of the biological and psy- 
chological influences upon human and 
animal sexual motivation. Students of 
human behavior, however, may find 
these statements irrelevant to what they 
consider the central problem of human 
sexual motivation, namely, sexual iden- 
tification (or masculinity and feminin- 
ity) and object choice, the selection of 
the persons or objects toward which 
sexual activities are directed. It is 
necessary, therefore, to relate these di- 
mensions of sexuality to the motiva- 
tional concepts which have been dis- 
cussed. 


Sexual Identification 


Money, Hampson, and Hampson 
(1955a, 1955b, 1956), in an excellent 
series of studies, have shown that an 
individual’s concept of his gender, male 
or female, is determined primarily by 
the way in which the individual was 
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reared. Pseudohermaphrodites with 
identical gonads and anomalous genital 
development consider themselves and 
are considered by others either male 
or female depending upon whether they 
were raised as male or female. These 
findings seem to conflict with our 
knowledge of animal sexuality which 
indicates that the development of male 
and female sexual motivation is deter- 
mined by genetic hormonal influences 
(Young, Goy, & Phoenix, 1964). This 
conflict arises because it is thought that 
human sexual identification allows as 
reasonable inferences about sexual mo- 
tivation as does animal sexual behavior. 
However, “sexual identification” and 
“gender role” are psycho- and socio- 
sexual concepts denoting socially de- 
fined, sexually dimorphic (or more 
properly, sexually dipsychic) behaviors, 
Gender-specific behaviors vary widely 
from culture to culture and include ac- 
tivities such as working, cooking, hunt- 
ing, and house cleaning, as well as the 
initiation of sexual activities (cf. Mead, 
1961). The gender-role concept does 
not necessarily include the concepts of 
sexual motivation, arousal, or arousa- 
bility. There is no evidence to indicate 
that sexual motivation is inherently 
dimorphic and characteristic of one 
gender or the other, Of course, the 
specific forms taken by sexual behavior 
are affected by the sexual identification 
of the individual. For example, the 
propensity for arousal by visual stimuli 
may be considered one component of 
the male gender role (Kinsey et al, 
1953). However, one would not want 
to conclude from data of this nature 
that males and females differ in the 
type or form of erotic arousal in any 
meaningful way. We find it reason- 
able to assume, rather, that sexual 
arousal is similar in males and females, 
although the expression of arousal is 
sexually dimorphic and determined by 


an individual's experience within his 
culture. . 

Furthermore, while the sexual iden- 
tification of an individual determines 
some facets of the expression of sexual 
arousal, not all components of sexual 
arousal are gender specific. It is not 
even clear, for example, that the copu- 
latory responses of men and women 
are entirely distinctive, Certainly the 
stereotyped postural responses charac- 
teristic of nonhuman Species are not 
found in humans. The little. experi- 
mental work which has been done on 
human sexual behavior indicates that 
one aspect of the copulatory response 
pattern, orgasm, is essentially equiva- 
lent in males and females. ^ Masters 
and Johnson (1960, 1963) have shown 
that the vaso-congestive phenomena 
and the pelvic-genital contraction phe- 
nomena which constitute orgasm are 
similar in males and females. Mechan- 
ical and fluid-expulsion differences are, 
of course, present, but these should not 
cloud the basic anatomic and physio- 
logic similarities which are present, 
These similarities should be considered 
particularly striking since orgasmic 
Processes have been considered quite 


different in males and females for so 
long. 


role or 


Ty widely from culture 
3 culture and from age to age, and, 
therefore, cannot result from biological 
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(hormonal) influences. Hampson and 
Hampson (1961) also dismiss the hor- 
monal determination of object choice 
because they have repeatedly found in- 
dividuals of similar hormonal status 
who direct their sexual activities to 
males or to females in a way which re- 
flects their sex of rearing rather than 
their hormonal sex. These investiga- 
tors conclude from these data that 
learning and experience, rather than 
hormones, determine the choice of sex- 
ual object. The implication from this 
conclusion is that learned object choice 
is part of the sexual motivation. This 
implication is unwarranted. There is 
certainly little evidence, even in the 
animal literature, to indicate that ob- 
ject choice determines sexual motiva- 
tion, or that sexual motivation deter- 
mines object choice. In animals as in 
man, object choice may be determined 
by experience. Dogs masturbated by 
an experimenter will come to direct 
their sexual behavior toward that ex- 
perimenter (Beach, 1950), and male 
cats may be readily trained to mount 
other males or even toys (Michael, 
1961). In these cases one would not 
say that motivation has been altered by 
the training experiences. 

We feel that the object chosen is 
learned and represents one facet of the 
habit component of the expression of 
sexual behavior. This distinction 1s 
similar to the one drawn by Freud 


(1962). 


APPROACHES TO SEXUAL MOTIVATION 


At several points in this discussion, 


the present propositions have been 
placed in contrast to those of Hardy 
(1964). These contrasts have been 


drawn in order to highlight our differ- 


ences in theoretical approach to the 
al motivation. 


problem of sexu no 
Hardy has followed one historically 


valid approach to the problem of mo- 


tivation, namely, one which considers 
the direction of behavior an integral 
part of the motivation. In such cases, 
motives are thought to act partially 
through a process of stimulus selection. 
The hungry organism orients toward 
food and not toward a mate. When 
applied to sexuality, such a theory leads 
to the notion that homosexuality and 
heterosexuality represent fundamentally 
different motivational states. The pres- 
ent theory, in contrast, may be charac- 
terized as "energetic" theory or 
“arousal” theory in which the expres- 
sion of the motivation is considered 
independent of the motivational state. 
Homosexual and heterosexual be- 
haviors, in the present theory, are con- 
sidered different habit states which may 
be activated by identical motivational 
states. 

These different theoretical ap- 
proaches to sexual motivation lead to 
quite different relative emphases upon 
experience and the biological basis of 
sexuality. Hardy’s response-directed 
theory finds little biological contribu- 
tion to sexual motivation, since so much 
of the expression of human sexual be- 
havior is determined by experience. 
The present energetic theory, on the 
other hand, emphasizes the biological 
basis of motivation and considers the 
expression of behavior a combined 
function of motivation and habit. 
Whether the response-directed theory 
or the energetic theory will prove more 
effective in eliciting fresh research and 
new theoretical formulations must re- 
main an empirical question. 
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Since the publication of Hebb's The 
Organization of Behavior in 1949, psy- 
chologists have become increasingly 
concerned with two closely related 
problems—the definition of cognitive 
Structure and the Specification of a 
mechanism for cognitive change. 

There is great diversity in the defini- 
tion of cognitive Structure. Hebb’s 
emphasis has been on the elaboration 
of perceptual Processes as a general 
model for cognition, Bartlett (1932) 
discussed schemata as the structures 
underlying memory and its distortions, 
while Piaget (1950) used the same 
word to comprehend a very different 
range of phenomena—the development 
of intelligent behavior in children, 
Two more recent statements have ex- 
tended the notion that human thought 
can be conceived as a System of rules, 
Newell, Shaw, and Simon (1958) dis- 
cuss the application of computer tech- 
nology and logic to an understanding 

of human thinking, and Miller, Galan- 


! The research reported here was sup- 
ported in part by a grant from the Carnegie 
Corporation of New York. 
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but most recent proposals have con- 

"tained the notion of discrepancy be- 
tween sensory input and cognitive 
Structure or discrepancy between two 
elements of cognitive structure as de- 
manding cognitive change. The re- 
moval or reduction of a discrepancy 
1s assumed to be responsible for cog- 
nitive change (see Hunt, 1963, for a 
review of several theoretical positions 
on this issue). 
. Even before questions about cogni- 
tive structure and change can be ad- 
dressed, it is necessary to make some 
Statement about the nature of the en- 
vironment from which cognitive struc- 
tures must develop. Some psycholo- 
gists have continued to use the tradi- 
tional division of the environment into 
common objects and, rarely, physical 
quantities under the general term stim- 
ulus with occasional reference to char- 
acteristics of receptor end organs. A 
somewhat more general procedure for 
description of the environment has been 
suggested in the work of Attneave 
(1957) and Garner (1962), among 
others. For these psychologists, the 
environment is seen as a field of varia- 
bility characterized by relative inde- 
pendence or redundancy of events. 
They have used modifications of 
communication theory (Shannon & 
Weaver, 1949) to develop metrics for 
the specification of environmental vari- 
ability. 
_ We have recently reported (Mun- 
singer & Kessen, 1964; Munsinger, 
Kessen, & Kessen, 1964) a series of 
studies on the structuring of environ- 
mental variability. Specifically, we 
asked subjects to state their preferences 
for (a) random shapes that varied 
widely in number of independent ele- 
ments and ( b) sequences of letters and 
words that varied from random to fully 
redundant. Those studies und theme 
tois reported here grew out of omis 
general assumptions. The purpose o 
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the present report is to state these 
governing assumptions briefly, to pre- 
sent four new studies derived from 
them, and, in the light of the results 
of these studies, to indicate required 
modifications and extensions of the 
assumptions. Our initial assumptions 


were four. 
1. Variability. Human beings are 


sensitive to stimulus variability ; this 
variability may be usefully specified in 


the metric of information theory 
(Attneave, 1959). 
2. Limit. There is an upper limit 


on the capacity of human beings to 
process independent environmental 
events (Miller, 1956). 

3. Structuring. Human beings are 
able to break through this limitation on 
their capacity to process independent 
events through the development of 
rules (structure) which reduce the 
effective stimulus variability ( Mandler, 
1962). 

These first three assumptions can be 
interrelated in another way. People 
respond to cognitive uncertainty. In 
the limiting case, the unrealistic one in 
which there is no organizing cognitive 
structure, cognitive uncertainty is a 
simple function of stimulus variability. 
However, even for unsophisticated hu- 
man beings—the newborn infant, at 
the extreme—rules are available which 
permit some coding or processing of 
stimulus variability. Thus we can 
think of cognitive uncertainty as a joint 
function of cognitive structure and 
stimulus variability. 

It should be remembered that, in the 
present studies, we have assumed that 
human beings have a preference for 
a certain level of cognitive uncertainty 
(Munsinger & Kessen, 1964). Cog- 
nitive uncertainty reflects the person's 
ability to impose meaning (cognitive 
structure) on the array of information 
available in his environment (stimulus 
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variability). If stimulus variability 
changes with no change in SH dm 
structure, then preference should shift. 
Similarly, if the subject imposes struc- 
ture on a fixed field of stimulus varia- 
bility, then his preference will change 
also. Certainly preference is not the 
only index of cognitive uncertainty ; for 
this reason we have reported relevant 
evidence on estimation accuracy and 
categorization. The measure of prefer- 
ence, however, has the advantage of 
expressing, in part, the motivational as- 
pects of cognitive uncertainty, If the 
development of structure results in the 
subject’s preference for increasing 
amounts of stimulus variability, we 
have a general model for intellectual 
development. As the Person learns to 
structure variability, he turns toward 
higher levels of variability, 
It is not altogether clea 
are imposed on random shapes to re- 
duce stimulus uncertainty. Subjects 
may categorize the figures in terms of 
previous experience (“They look like 
something.”) or, in the case of the 
art-student subjects, in terms of an im- 
posed dimensionability (jaggedness or 
weight), or in t 
rization, 


r how rules 


the structuring 
ear that it is ac- 


estimation, accuracy 
or preference. 


4. Cognitive C, hange, 


A person will 
be most likely to change : ts 


his cognitive 

? Another interesting example of Change 
in behavior with the assumed development of 
structure is provided by a Study of estima- 
tion accuracy in which two groups of sub- 
jects (N = 120) were asked to look at and 
to try to remember 5-turn figures (Group 
1) or 40-turn figures (Group 2). SAM 
140 presentations at 7 seconds duration, all 
subjects were asked to estimate four levels 
of stimulus variability at 200 milliseconds 
exposure duration. A , Statistically stable 
(Fou = 4.01; p<.01) interaction reflected 
the fact that Group 2 was more accurate in 
estimation of 40-turn figures. 
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Structure if he is presented with a level 
of cognitive uncertainty just beyond 
his present processing ability. If he is 
presented cognitive uncertainty far 
beyond his ability to process or no 
cognitive uncertainty at all, little or no 
change will take place (Piaget, 1952). 


In our first studies derived from 
these assumptions we decided to ex- 
amine human preference for variability. 
In part, this decision grew from our 
Concern with problems of motivation. 
What aspects of the environment would 
claim the attention of à person? What 
parts of the environment would be at- 
tractive to him? As Piaget (1952) 
Says “. . . a subject looks neither at 
what is too familiar, because he is in a 
way surfeited with it, nor at that which 


is too new, because this does not cor- 
Tespond to anythi 


-.. [p.68].” Tl 


of the Structuring of variability, we 


aS we examine 
aspects of the 
here—the limitation 
ity, the developm 
ture, and the rol 
in forming cog! 


9n processing abil- 
ent of cognitive Struc- 
€ of specific experience 
nitive structure, 


Processing Limits 


) has reviewed much of 
rature on the existence 
of an upper limit on the ability of hu- 

beings to receive, process, and 
transmit environmental variability. 
Garner (1962) has summarized more 


In 


: we 
n in the preference. 
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variability function with stimuli of 
about 10 independent turns. Although 
such findings fit well with the earlier 
literature on processing limits and with 
our assumptions, there was a clear need 
for an independent test of the assump- 
tion of the processing limit, Study I 
was designed to provide such a test. 


Study I 


Subjects. Fifty-four undergraduate stu- 
dents in psychology courses at the University 
of Illinois and 108 students in the second, 
third, fifth, and sixth grades of School Dis- 
trict Number 68, Skokie, Illinois, served as 
judges in this study. Data from the second 
and third graders were combined, as were 
data from the fifth and sixth graders. 

Stimulus Materials. Seven instances of 
eight levels of stimulus variability repre- 
sented by random figures of 5, 6, 8, 10, 13, 
20, 31, and 40 independent turns (Attneave 
& Arnoult, 1956; Munsinger & Kessen, 1964) 
were presented to the judges in two ran- 
domized blocks. Each stimulus was shown 
on a projection screen for 4 second; the in- 
terval between presentations was approxi- 
mately 10 seconds. 

Procedure. Judges were shown examples 
of the figures and told the number of cate- 
gories to be judged and the number of turns 
defining each category. They were asked 
to guess the number of turns in the figures 
projected. The procedure was essentially 
that used by Kaufman, Lord, Reese, and 


Volkmann (1949). 


Results 

Figure 1 shows the relation between 
number of independent turns defining 
each category of random shapes and 
judges’ accuracy of estimation. The 


number of correct estimations de- 
creased linearly between figures of 5 
For 


turns and figures of 10 turns. 
figures beyond 10 turns, no further de- 
crease in accuracy occurred (Fr/1,113 
for linear trend = 639.0; Pius for 
quadratic component = 68.6). More 
recent data from our laboratories indi- 
cate that, for relatively brief ex- 

8 The authors are grateful to Wesley Gibbs 
and the staff of Skokie School District Num- 
ber 68 for their support and assistance. 
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NUMBER OF INDEPENDENT TURNS 
Fic. l. Mean number of correct estima- 


tions of number of turns of eight levels of 
random shapes for three age groups. 


posures, there is no interaction between 
duration of exposure and the limits of 
estimation accuracy. There is an ap- 
parent age difference in accuracy 
(Fans = 16.25 € .001) and an inter- 
action (Fus = 83; p< .001) be- 
tween age and stimulus variability. 
The slopes of the functions beyond 10 
turns are parallel from one age group 
to another. 

These findings lend strong inde- 
pendent support to the postulate that 
there is an upper limit on the ability 
of human beings to process variability 
of stimulation. For the random shapes 
this limit is in the range of 8-10 inde- 
pendent turns. This finding is in good 
agreement with previous results which 
indicate an inflection in the preference- 
variability function at about 10 inde- 
pendent turns. 

The relation between estimation ac- 
curacy and stimulus variability in the 
range between 5 and 10 independent 
turns that was found in the present 
study is reflected in the preference data 
of a number of earlier studies. For 
children. from first to eighth grade 
and for adult subjects who have had 
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varying experience with — 
shapes, the preference functions Or 
figures between 5 and 10 independent 
turns are essentially parallel. This is 
not at all the case if we consider the 
stated preference of subjects for fig- 
ures with more-than 10 independent 
turns. The preference function for 
figures between 10 and 40 turns does 
not resemble closely the corresponding 
portion of the estimation-accuracy func- 
tion. Moreover, the preference func- 
tion for figures of high variability 
shows systematic and significant varia- 
tion among groups of subjects who dif- 
fer in age and in experience with ran- 
dom shapes ( Munsinger & Kessen, 
1964). This wide variation in response 
to figures above 10 turns suggests the 
operation of a procedure for processing 
high stimulus variability which is very 
different from the simple perceptual 
one which may be reflected by the re- 
sults of the estimation study. 


Tur DEVELOPMENT or STRUCTURE 


We have examined three sources of 
variation in the handling of figures of 
high variability—long-term experience 
with complex forms, short-term experi- 
ence with the random shapes, and age 
(Munsinger & Kessen, 1964; Mun- 
singer et al., 1964), 


Long-Term Experience with 


Complex 
Forms 


Art students were asked to state 
their preferences for random Shapes 
varying from 5-40 independent turns 
in the expectation that their experience 
with geometrical figures would have 
led to the development of rules for 
structuring high variability. The re- 
sults confirmed the expectation; the 
preference-variability function for the 
art students was generally monotonic 
and positive. It is interesting also 
that the art students found figures of 


few turns "dull, plain, and uninterest- 
nom 


ing. 


Short-Term Experience with Random 
Shapes 


A group of college undergraduates 
was asked to state their preferences 
over and over again for the same fig- 
ures differing in stimulus variability 
under the hypothesis that increasing 
experience with random shapes would 
lead to the development of procedures 
for structuring. Preference for fig- 
ures of high variability was expected to 
shift upward following 2 hours of 
paired-comparison preference judg- 
ments. Again, the results supported 
the conclusion that preference for high 
variability may be changed through de- 
velopment of coding rules. The sub- 
jects showed higher preference for 
figures of many turns toward the end 


of the session than they had at the 
beginning, 


Age 


The changing Preferences of children 
es of high Variability were ex- 
y means of the same procedures 
for determining preference used with 
adults. Children in all grades of an 
elementary School made preference 
judgments of the 5. to 40-turn random 
shapes. The expectation of an age- 
invariant area of inflection of the pref- 
erance-variability function near figures 
of 10 independent turns was confirmed, 
There is surprising similarity in the 
shape of the Preference-variability func- 
tion between 5 and 10 independent 
turns when the Tesponses of human 
beings from ages 6-22 are compared. 


already been Suggested, no 


i y (Le. figures of more 
than 10 turns), We had expected that 
children would have less ability to 


structure figures of high variability and 


-« 
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that they would therefore show lower 
„preference for them. However, the 
children prefer figures of high variabil- 
ity far more than adults do. The de- 
creasing preference for figures of high 
variability with increasing age is regu- 
lar; second graders state lower pref- 
erences for figures of high variability 
than do first graders, third graders 
state lower preferences for figures of 
high variability than do second graders, 
and so on. These unexpected findings 
lead to the studies of developmental 
variation in ability to categorize re- 
ported here. 

Sense may be made of the unpre- 
dicted preference of young children for 
figures of high variability if it is as- 
sumed that young children and adults 
process high variability in the following 
different ways. Young children may 
sample the high-variability figures, at- 
tending only to that part of the figure 
which they can structure; figures of 
many turns provide more opportunity 
for this process of selection than do fig- 
ures of few turns. Thus, young chil- 
dren could handle and would prefer 
figures of high variability. Adults, on 
the other hand, are more likely to 
attend to the entire figure whatever its 
level of variability. This strategy 
places a demand on adults to structure 
figures of many turns, their inability 
to do so leads unsophisticated adults to 
express relative dislike for figures of 
high variability. 

A clear and testable proposition de- 
rives from this assumption. If young 
children attend to only part of the 
figures of many turns, then they should 
exhibit differential difficulty in cate- 
gorization—that is, the formation and 
use of classes based on high stimulus 
variability—when their ability to cate- 
gorize is compared with the ability of 
older children and adults. Moreover, 


younger children should show less im- 
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provement over trials in their ability to 
categorize than older children and 
adults because strategies of categoriza- 
tion based on selection of part of a 
random figure are inappropriate for the 
task of classification. 

Two studies were designed to test 
this prediction about the relation of 
high stimulus variability and age to 
the ability to categorize. Study II re- 
quired subjects to associate nonsense 
names to classes of random shapes of 
four levels of variability; Study III 
required subjects to sort random shapes 
on the basis of their variability. 


Study IT 


Subjects. Twenty-two children from each 
of four different grade levels (second, third, 
fifth, and sixth) in Champaign, Illinois, 
schools and 22 adults from introductory psy- 
chology classes at the University of Illinois 
participated in Study IL* 

Specific Method. Six different examples 
of each of four levels of stimulus variability 
were used. One level contained only random 
figures of 5 independent turns, another level 
only 10-turn figures, a third only 20-turn 
figures, and the fourth level contained only 
40-turn figures. Subjects were told: "Try 
to learn which name goes with which figure. 
There are things about the figures which 
will help you to know which name goes with 
which figure." Subjects were encouraged 
to guess; they wrote their responses on pre- 
pared answer sheets. 

Each figure and its "name" were pro- 
jected for 10 seconds to a front wall of the 
room in which subjects were seated. The 
24 different random shapes were presented 
with order randomized within balanced 
quarters on each of 6 runs. On the first, 
third, and fifth runs, the figures were fol- 
lowed by their appropriate names (non- 
sense syllables of matched medium-associa- 
tion value). On the second, fourth, and 
sixth runs, the figures appeared in a different 
order, and the names did not appear. Sub- 
jects were asked to write the appropriate 
name of each shape as it was presented for 
10 seconds. The scores used in a mixed 


4 The authors are grateful to Florence Aye 
and her staff at the Dr. Howard Elementary 
School, Champaign, Illinois, for their support 
and assistance. 
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Fic. 2. Mean number of correct re- 
sponses in classification of random shapes as 


related to age, stimulus variability, and 
trials. 


analysis were the number of correct names 
produced by each subject for each level of 
variability. For all groups, 5-turn figures 
were named BIP, 10-turn figures were named 
QOR, 20-turn figures were nai 


med ZAL, and 
40-turn figures were named TUJ. A response 
was scored as correct if the subject gave two 
of the three letters in the syllable in Proper 
order. Two additional groups of subjects 
(22 fourth graders and 22 third graders) 
were run with different relations of non- 
sense syllables and figures. The results of 
this control series were the same as those 
described below. 


Results 


There was a consistent monotonic 
relation between age and accuracy of 
class naming (Fanos = 304; p< 
.001). There were also Statistically 
significant effects of trials—Trial 1 
versus Trial 3 (Pig = 27.1 p< 
.001) and stimulus variability (Fray = 
24.5; p « .001). Of greatest interest, 
the predicted interaction among age 
level, stimulus variability, and trials 
shown in Figure 2 was Statistically 
stable (Fi, = 2.31; p < .01). 

The results shown in Figure 2 dem- 
onstrate that younger children have 
differentially greater difficulty learning 
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to associate names to classes of figures 
of many turns and, further, that 
younger children profit relatively less 
from experience. This effect is shown 
best by the differential achievement of 
correct names for the class of 40-turn 
figures. The second- and third-grade 
children have not learned the correct 
name after Trial 1, and the second- 


and 
third-grade children still have not 
learned the name of the class of many- 


turn figures after Trial 3. The fifth- 
and sixth-grade subjects respond with 
many more correct names to the 40. 
turn figures on Trial 3 as compared to 
their performance on Trial 1, while 
adults show relatively high attainment 
of correct solutions on both Trial 1 and 
Trial 3, 


Study III 


A Second study of categorization was 
designed to test, in a somewhat dif- 
ferent way, the assumption that 


o all 
iron- 


younger subj 
show differentially higher 
for shapes of high v 


ects would 
error rates 
ariability.5 

Stimulus Materials, Photographs of 32 
different random Shapes were printed on 5 
X7 inch glossy Paper. Two prints each of 
eight 5-turn figures, eight 10-turn figures 
eight 20-turn figures, and eight 40-turn fie. 
ures were used. g- 

Subjects. Thirty-seven st 
ductory psychology class 
of Illinois, and 37 third 
grade pupils served as s 


udents in intro. 
es at the University 


-grade and 37 sixth- 
ubjects,o 


ë The data for this Study were collected by 
Marvin Daehler during his tenure as an 
NSF Undergraduate Research Student, 


: "a authors are grateful to Floyd Daub 
and his staff at Milled eville Ele; 
School for thei: ind E 


T Support and assistance, 
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* Procedure. Subjects were given the stack 
of 64 photographs in random order and 
the following instructions : “Here is a set 
of pictures. Sort them into four groups 
which you think seem to g0 together. Make 
each of the piles approximately equal in 
size. Don't take too long looking at the 
pictures, just look at one and put it into 
one of the piles. You may look at several 
pictures before beginning to sort them into 
piles. Any questions? OK, you may begin.’ 


Results 


Percentage of err 
for all subjects for t 
variability of shapes. 
response bias t 
each category was N—/N 
number of shapes appropriate to that 
category which were placed outside it 
(N—) divided by the total number 
of shapes, whether appropriate OF not, 
placed in the category (Nr)- The 
simpler measure of number of correct 
placements in each category produced 
essentially the same results. Figure 3 
presents the results of Study IH. The 
ordinate represents a reciprocal form 
of the error measure in order to make 
the results shown comparable to the 

The 


results obtained in Study Il. 


younger the subject, the more difficulty 


he has in correctly categorizing figures 
of many turns. This interaction be- 
tween age and stimulus variability 1s 
significant (Foss: = 337; p< 0D- 


The effect of age 15 uw. ve 
= 10:33 < 001), as is the eec o 
pe : 437; P 


stimulus variability (Fajssa = 
« 001): 

An interesting secondar ; 
sulted from an analysis of the direc- 
tion of miscategorization. When only 
shapes erroneously categorized were 
considered, shapes rated high in mean- 
ingíulness were placed in the category 
below the correct one, and shapes 
rated low in meaningíulness were 
placed in the category above the Ts 
rect one (67 ; p< 05). T 
finding co our earlier ob- 


or was calculated 
he four levels of 

To correct for 
ed for 


; that is, 


y finding re- 


nforms with 
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servation that meaningfulness reduces 
the apparent variability of random 
shapes (Munsinger & Kessen, 1964). 

The results of these two studies of 
categorization are congruent with the 
notion that young children do not at- 
tend to all the variability presented in 
figures of many turns. In both stud- 
jes there was a reliable interaction be- 
tween age and level of stimulus varia- 
bility with almost all of the interaction 
variance contributed by age variation 
in the categorization of 20- and 40- 
turn figures. Thus, regardless of the 
overall shape of the function, there 
seems to be a clear difference across 
age in the ability to classify figures of 
high variability. However, neither our 
argument nor the data in support of 
it are conclusive; the preference of 
children for high-variability figures re- 
mains somewhat puzzling. 

The studies of categorization, while 
they help us to understand develop- 
mental changes in the structuring of 
variability, introduce additional prob- 


RECIPROCAL PERCENTAGE ERRORS 


5 10 20 40 
NUMBER OF INDEPENDENT TURNS 


Fic. 3. Reciprocal percentage errors made 
by subjects of different ages in categorizing 
figures of different numbers of independent 


turns. 
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he theoretical position we 

i len, How is it that adults are 
able to categorize 40-turn figures so 
well, in the light of our earlier con- 
clusions that 40-turn figures sioni 
be difficult for adults to structure? 
This formation of the question leads 
to a clarification of what is meant by 
the structuring of variability. . The 
ability of an adult to differentiate a 
particular figure of high variability 
from others or even his ability to 
classify figures of high variability does 
not entail his understanding of the 
figures, that is, his ability to make 
sense of them in terms of his present 
cognitive structure. To take a differ- 
ent example, one can recognize the dif- 
ferences between classical mechanics 
and relativity without being able to 
structure relativity. It is for this reason 
that the implications of the categoriza- 
tion studies are equivocal. The fact 
that adults are able to categorize fig- 
ures of high variability does not imply 
that they can Structure them. On the 
other hand, the inability of young chil- 
dren to categorize figures of high vari- 
ability lends Support to, but does not 
demonstrate, the conclusion that young 
children respond to figures of high 
variability by sampling only part of 
the presented variability, 


EXPERIENCE AND STRUCT URE 


Perhaps the most interesting prob- 
lem for a theory of cognitive structure 
is the specification of a mechanism 
of cognitive change. As we have 
seen, a number of commentators have 
pointed toward the importance of stim- 
ulation which is just beyond the per- 
son's present ability to structure. This 
position concerning cognitive change is 
Worked out in greatest detail in the 
Observations Piaget made on his chil- 
dren during their first 3 years of 
life. Having specified a domain for 


cognition — stimulus variability — and 
having found systematic variation of 
human response in this domain, it is 
possible for us to state hypotheses 
about the conditions for cognitive 
change. Let us say that a person 
expresses a preference for figures of 
intermediate variability, "This expres- 
sion of preference indicates that inter- 
mediate variability is just beyond his 
Present ability to structure. If we 
want to shift preference by subjecting 
him to a range of experience that will 
be optimal for cognitive change, it 
would appear that (a) stimulation 
within his present level of preference 
would have little or no effect on his 
preference structure ; (b) stimulation 
preference level 
€ effect on his 
ecause such stim- 
d his present ca- 
and (c) stimu- 
just beyond his 
€rence should be 
changing cogni- 


Study IV 


In Study TV, second and third grad- 
ers, fifth and sixth graders, and adults 
were given restricted experience with 
four levels of stimulus variability, 
Change in expressed Preference for 
random Shapes was used as 


s a measure 
of the effect of experience with varia- 
bility. As a further test of the ef 


tested either immediately a 
ence with one of the leve 
bility, or 24 hours later. 

was designed, therefore, to 
role of restricted experience 
ability in producing cognitiy 


fter experi- 
Is of varia- 
This study 
assess the 
with vari- 
'€ change, 


"m 
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and the variation of these effects over 
age and time. 


. Subjects. A total of 480 subjects served 
in this experiment. Three hundred and 
twenty elementary school children from 
Champaign, Illinois, schools * and 160 col- 
lege students from the University of Illinois 
undergraduate psychology subject pool 
served as subjects. One hundred and sixty 
second- and third-grade pupils were in the 
“young” age group, and 160 fifth- and sixth- 
grade pupils were in the “intermediate” age 
group. Both groups had equal numbers of 
boys and girls and equal numbers of sub- 
jects from each component grade. 

Stimulus Materials, Five sets of eight 
random shapes were constructed using a 
procedure developed by Attneave and Arnoult 
(1956) and described by Munsinger and 
Kessen (1964). 

One set of eight random shapes con- 
tained figures of 5, 6, 8, 10, 13, 20, 31, and 
40 independent turns. This set of shapes 
was called the Test Set. Each of the other 
four sets contained eight different shapes 
of the same number of turns. Thus, the 5- 
turn set was made up of eight different 
figures of 5 independent turns, the 10-turn 
set of eight 10-turn figures, the 20-turn set 
of eight 20-turn figures, and the 40-turn set 
of eight 40-turn figures. 

The stimuli were projected in pairs to the 
front wall of the room in which the subjects 
were seated. Subjects were instructed to 
state their preferences for each pair of 
stimuli. 

Procedure. Eight groups of 20 subjects 
were used at each of the three age levels. 
Subjects in each age group were first asked 
to state their preferences for the Test Set 
of random shapes presented in an optimal 
order (Ross, 1934) according to a paired- 
comparison procedure. This allowed an 
estimation of a base level of preference for all 
the groups. Immediately following the pres- 
entation of Test Set for judgment, two ran- 
domly selected groups of subjects at each age 
were presented four successive “an 
paired-comparison preference runs (11 
:adgments) of S-turn random shapes. Two 
Juce ndomly selected groups, immediately 
opua leting judgments of Test Set, 
after completing J -e aiima 
were presented four successive pa rei 
parison preference runs with the 


E e Mal- 
= are grateful to Ann 
I : Tie ae at ‘Bottenfield Elementary 
School, Champaign, Illinois, for their sup- 


port and assistance. 
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ments "of Test Sel c ee 

; 2 t, were presented four 
successive paired-comparison runs with 20- 
turn shapes. The remaining two groups, im- 
gri -— completing judgments .of 

5 , cre presented four successive 

paired-comparison preference runs with the 
40-turn random shapes. 
: Thus each group within each age level 
judged a Test Set of varied random shapes 
(28 pairs of stimuli) and then made 112 
judgments of pairs of 5-turn, 10-turn, 20- 
turn, or 40-turn random shapes. Thé three 
age groups were run at different times. 
For each age level, following the repeated 
exposure to one of the four different levels of 
variability experience, one of the two groups 
from each experience level was asked to 
leave the room and to return again the next 
day. The four remaining groups of 20 sub- 
jects at each age level immediately made 
paired-comparison judgments of the same 
Test Set seen previously. The other four 
groups of 20 subjects returned 24 hours later 
and made paired-comparison judgments of 
the same Test Set of random shapes. 

In summary a total of 24 groups was 
used. Four groups at each of three age 
levels were tested for change in preference 
for variability immediately after experience 
with either 5-, 10-, 20-, or 40-turn random 
shapes. Four other groups at each of three 
age levels were given comparable experiences 
with levels of variability, but were tested 24 
hours later. 

Preference data were analyzed by conver- 
sion of pairwise preference to normalized 
deviates and then by taking least-squares 
solutions for the complete matrix of the re- 
sulting deviates as was done in earlier studies 
(Munsinger & Kessen, 1964). 

Preference scores for the Test Set repli- 
cated the results of earlier studies in showing 
high preference for an intermediate level of 
variability (10 turns). Because our interest 
was in change of preference, difference scores 
were used in the analysis. A mixed-design 
analysis of variance was performed on the 
difference scores between judgments made 
during the first presentation of Test Set and 
judgments made during the second presenta- 
tion of Test Set. The response measure 
used to compute the difference score was the 
number of "votes for" each random shape in 
Test Set by each subject. To ensure un- 
constrained row variance, votes for only 
the 5-, 10-, 20-, and 40-turn shapes of the 
Test Set were included in the analysis. 
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Results 
i5 IMMEDIATE TEST 


It should be noted that figures of 
varying numbers of turns are used in 
two distinct ways—as experimental 
manipulation (Experience) and as re- 
Sponse measure (Measure). Figures 
4 and 5 show the interaction among 
age, experience with variability, and 
time of testing summed over the four 
measures, A between-groups analysis 
of variance shows this multiple inter- 
action to be highly significant (Fase 
= 1025 p< 001). As can be seen, 
neither adults nor young children 
markedly change their preference for 
variability immediately after the ex- Ir. 
0 20 a5 perimental manipulation of experience 
EXPERIENCE (Figure 4); the Children of inter- 
NUMBER OF INDEPENDENT TURNS mediate age change substantially, par- 
Fic. 4. The change in preference for ticularly in their response to experi- 
random shapes of subjects at three age ence with 5- and 10-turn figures. 


levels immediately after experience with Twenty-four hours after experience 
levels of variability. 


uw 


orES FOR ALL MEASURES 
o 


' 
uw 


(TEST SET - TEST SET\] 


2. 
o 


MEAN CHANGE IN Vi 
uw 


5 


99 5-TURN TEST 
g 24-HOUR TEST Y 
2 
3 3 ad t 
< 2 
E] L 
= z , 
m = (03) 
E pn E] di 
xü tu Adult 
g M n’ 
RE x 5 Lo 
a O w 
ae č E 
E wv i 
E Wu s 
gu Bs 
zu su -*Gr5«6 
=v e ds " 
Wk zou 
ou E » uw 
Gime ge 
ó -.5 
[») t | 
Z 5 U | 
z 
d - 
= E 1.0] 
= 
$ 10 20 40 
EXPERIENCE iS 
NUMBER OF INDEPENDENT TURNS EXPERIENCE 
Fic. 5 The change in preference for NUMBER or INDEPENDENT TURNS 
random shapes of subjects of three age levels Fic. 6 Cites 
i i . 6. e 
24 hours after experience with levels of of 5 turns of shee ee for figures 
variability, 


S at three age levels 


after experience with levels of variability, 


STIMULUS VARIABILITY 


IO-TURN TEST 


- TURN FIGURE 
-TEST SET,) 


MEAN CHANGE IN VOTES FOR 10 
(TEST SET> 
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EXPERIENCE 


NUMBER OF INDEPENDENT TURNS 


Fic. 7. Change in preference for figures of 
10 turns of subjects at three age levels after 
experience with levels of variability. 


with variability (Figure 5), children 
of intermediate age show relatively lit- 
tle effect of the manipulation, young 
children shift somewhat, and adults 
change their expressed preference dra- 
matically. 

The effects of experience with vari- 
ability can be further understood by 
a consideration of the changes in ex- 
pressed preference for the different 
measures at the three age levels. 
Changes in preference for the four 
measures are plotted separately in Fig- 
ures 6, 7, 8, and 9. Statistical evalua- 
tion of the measures requires within- 
subjects comparison. The interaction 
of Measures X Age X Experience X 
Time reflected relatively well the time 
difference shown in Figures 4 and 5, 
but did not reach statistical signifi- 
cance. The interaction shown in Fig- 
ures 6, 7, 8, and 9—Age X Experience 
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X Measures—was significant RETA 
= 225; p « .01). 

Subjects of all ages showed relatively 
little differential change in their ex- 
pressed preference for figures of 10 
and 20 turns (Figures 7 and 8). In 
general 10-turn figures were liked 
Somewhat better in the second presen- 
tation of the Test Set, and 20-turn fig- 
ures somewhat less. The interesting 
differences, both across experience and 
across age, were in response to the 
5-turn shapes (low variability), shown 
in Figure 6, and to the 40-turn shapes 
(high variability), shown in Figure 9. 
In Figure 6 (response to 5-turn fig- 
ures as measure) the young and inter- 
mediate-age children show much the 
same pattern of shifts in preference. 
Experience with 5-turn figures in- 
creased their expressed preference for 
5-turn figures. The other experience 
conditions produced little effect among 
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Fic. 8. Change in preference for figures of 
20 turns of subjects at three age levels after 
experience with levels of variability, 
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Fic. 9. Change in preference for figures 
of 40 turns of subjects at three age levels 
after experience with levels of variability. 


the children. Adults disliked 5-turn 
figures after experience with them, but 
somewhat strangely turned to the low- 
variability figures after 40-turn experi- 
ence. This shift represents largely an 
immediate reaction of the adults; 24 
hours after 40-turn experience the 
adults do not show an increased pref- 
erence for 5-turn figures, 

In Figure 9 (response to 40-turn 
figures as measure) the similarity of 
changes in expressed preference is be- 
tween the young children and adults. 
Both of these groups shift in prefer- 
ence for high variability after experi- 
ence with high variability. In addi- 
tion, the young children also show 
a preference for high variability fol- 
lowing experience with both 10- and 
20-turn figures. In contrast, the in- 
termediate-age children shift their ex- 
pressed preference away from high 
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variability under all conditions except 
10-turn experience. : 

The results of Study IV show sys- 
tematic effects of differential experi- 
ence with variability on preference 
judgments, but the changes in be- 
havior cannot be interpreted under the 
simple postulate about cognitive change 
proposed earlier. A closer examina- 
tion of Figures 6 and 9 will indicate 
where our theoretical presuppositions 
make sense and where they are in need 
of adjustment. The behavior of adults 
in response to 40-turn test stimuli is 
congruent with expectation; experi- 
ence with high variability (20- and 
40-turn figures) results in an increased 
ability to process variability and a 
corresponding shift in preference, For 
young children, a parallel performance 
must be given a different interpreta- 
tion. Taking these Tesponses of sec- 
ond and third graders together with 
the results of Studies II and III, one 
may conclude that experience with 10- 
20-, and 40-turn figures gives young 
children an Opportunity to apply their 
sampling-of-variability tactic. They 
find a variety of processable samples 


in figures of high variabilit 


r y, and the: 
like them. The fifth and sixth pti 


ers, assumed to attend to all of the 
presented variability, find experience 
with 20- and 40-turn figures little help 
in processing high variability; after 
exposure to variability well aboye their 
assumed processing limit, no change 
in preference occurs, Only the group 
of fifth and sixth graders given ex- 
perience with 10-turn shapes—a de- 


The relative simplicity of the fore- 
gomg account is inadequate to com- 
prehend the response of the groups of 
different age and different experience 


i 
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to figures of 5 turns. It will be re- 
membered that no substantial change 
in preference for figures of low varia- 
bility was predicted for any group of 
subjects. Figure 6, on the contrary, 
shows that the largest obtained dis- 
persion among age groups was in re- 
sponse to 5-turn figures. Why should 
experience with 5-turn figures lead to 
an increase in preference for similar 
figures among children and a decrease 
among adults? Equally puzzling, why 
should experience with high variability 
(40-turn figures) lead to an increase 
in preference for 5-turn figures among 
adults and no change or a decrease 
among children? To the first ques- 
tion, one may reasonably answer that 
5-turn figures are so much below the 
processing limits of adults that ex- 
posure to 224 instances of low varia- 
bility surfeits and bores them. It is 
less obvious to conclude that children 
who have looked at so many low-vari- 
ability figures still find enough un- 
certainty in 5-turn shapes to interest 
them. The second question requires 
an explanation for the paradoxical 
increase in adult preference for 5- 
turn figures after 40-turn experience. 
Clearly, this shift is not merely a 
turning away from an overload of vari- 
ability ; these same subjects show an 
increased preference for 40-turn fig- 
ures as well (Figure 9). But it should 
be noted again that the increased pref- 
erence for 5-turn figures shown by 
adults immediately after experience 
with 40-turn figures does not appear 
24 hours later and that the preference 
for 40-turn figures is greater 24 hours 
after experience with similar. figures 
of high variability than it is A oen 
ately after exposure. The influenc 


iati these phe- 

of temporal variation on 
nomena Of changed preference for 
s further 


stimulus variabi 
empirical analysis. 


N 


SUMMARY AND DISCUSSION 


A study of the ability of subjects of 
three age levels to estimate the amount 
of stimulus variability contained in 
random figures ( (Study I) supported 
the postulation of a limit on the proc- 
essing ability of human beings. Al- 
though variation with age was shown, 
in general all subjects showed decreas- 
ing estimation accuracy for figures 
varying from 5-10 independent turns 
and a very low but unchanging esti- 
mation accuracy for figures of more 
than 10 turns. This finding accords 
well with data obtained earlier on pref- 
erence for random shapes (Munsinger 
& Kessen, 1964). 

Although a postulate of limited proc- 
essing ability comprehends the data on 
estimation accuracy and preferences 
for figures of between 5 and 10 inde- 
pendent turns, it is inadequate for an 
explanation of the preference response 
to figures of more than 10 turns. We 
have shown in previous research that 
young children prefer figures of high 
variability more than do adults. Stud- 
ies II and III of the present report 
were designed to examine the sugges- 
tion that this apparently paradoxical 
finding was due to the tendency of 
young children to attend to only part 
of the variability presented in random 
figures of many turns. The predic- 
tion, tested in Study II, that young 
children would have differential diffi- 
culty in attaching class names to fig- 
ures of high variability was confirmed. 
The prediction, tested in Study II, 
that young children would have dif- 
ferential difficulty in sorting figures of 
high variability appropriately was also 
confirmed. Taking these results to- 
gether with earlier findings (Mun- 
singer et al., 1964), it is our tentative 
conclusion that young children prefer 
high variability, and thus appear to 


be able to process high variabitity. 
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largely because they select from figures 
of many turns that amount of varia- 
bility which they can handle. 
A study of the effect of four levels 
of experience with variability on the 
preference behavior of subjects of 
three ages (Study IV) was designed 
to test the assumption that change in 
cognitive structure (and consequent 
change in preference) would be most 
likely when subjects were presented 
with levels of variability just beyond 
their ability to process. The results 
obtained were congruent with this pos- 
tulate only in part. The change in 
preference response to figures of high 
variability was generally as predicted; 
the change in response to figures of 
intermediate and especially to figures 
of low variability indicated that re- 
visions of the simple assumption about 
the nature of cognitive change would 
be necessary. Specifically, the results 
of Study IV suggested inter alia that 
(a) children remain responsive to the 
few-turn figures even though their ex- 
pressed preference for these figures is 
initially low; and (b) the response of 
adults (and probably of children) to 
experience with variability is depend- 
ent in significant part on the temporal 
relations of experience and test of 
preference. 
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TOWARD A THEORY OF BEHAVIORAL 
CONTAGION* 


LADD WHEELER? 
Naval Medical Rescarch Institute, Bethesda, Maryland 


Behavioral contagion is defined operationally and is contrasted with 


conformity, imitation, social pressures, and social facilitation. 


Experi- 


ments dealing with contagion are reviewed and theoretical statements 


derived from this review. 


The basic theoretical argument is that the 


lowering of the avoidance gradient in an approach-avoidance conflict is 
essential to the occurrence of contagion. 


Redi (1949), the social scientist 
most closely associated with the con- 
cept of contagion, lists four points as 
necessary for its occurrence: 


1. Existence of an acute conflict area 


within the imitator; strong impulse 


urge toward fulfillment of a vehement 
need on one hand, sufficient pressure 
from ego or superego forces to keep it 
down on the other. 

2. High degree of lability of the imi- 
tator's "personality balance" in the area 
concerned; impulses strong enough to 
press for release, controls only just 
strong enough to prevent that release. 

3. Existence of a similar type of 
strong urge toward impulse expres- 
sion along the same line in the initi- 
ator; his urge must lie in the same di- 
rection as that of his imitator. 

4. Open acting out in favor of im- 


. pulse satisfaction by the initiator, with 


1From Bureau of Medicine and Surgery, 
Navy Department, Research Task MR005.12- 
2005.01, Shbtask 1. The opinions and state- 
ments contained herein are the private ones 
of the writer and are not to be construed as 
official or reflecting the views of the Navy 
Department or the Naval Service at large. 

2] thank-Irving Altman and Albert Ban- 
dura for their comments on earlier versions 
of this paper. An extensive correspondence 
with J. Barnard Gilmore and a careful study 
of his (1965) paper have been most valuable. 
His paper may be obtained from him at the 


University of Waterloo. 


equally open display of an entire lack 
of fear or guilt. 


Combining these four points does not 
provide a definition of contagion, as the 
points more nearly constitute a theory 
than a definition. When Redl and his 
associates began systematic research on 
contagion, they defined the concept as 
follows: 


An incident of behavioral contagion, for the 
purposes of the present investigation be- 
comes, then, an event in which a recipient's 
behavior has changed to become "more like" 
that of the actor or initiator. This change 
has occurred in a social interaction in which 
the actor has not communicated intent to 
evoke such a change [Polansky, Lippitt, & 
Redl, 1950, p. 322]. 


Unfortunately, the above definition 
does not adequately distinguish con- 
tagion from other types of social influ- 
ence. For example, the Asch situa- 
tion fits the definition. 

Behavioral contagion ® will be de- 
fined in this paper through the use of 
reduction sentences (Carnap, 1936, 


3 The term “behavioral contagion” is used 
advisedly. Many psychologists, because of 
certain aspects in the development of the 
concept of contagion, associate it with the 


spread of an emotional climate. While emo- 


tions do engender like emotions under certain 
circumstances, this paper will be concerned 
with overt behavior which may or may not 
follow or be accompanied by the spread of 


emotions. 
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1937). One advantage in using reduc- 
tion sentences to form a definition is 
that the definition is then “open. 
Other reduction sentences may be 
added to the definition as knowledge 
of the phenomenon increases. One re- 
duction sentence will be used at this 
time to define contagion. It is labeled 
below as S (1). T" 

S (1). If the set of test conditions 
T, exists, then contagion has occurred 
if and only if Person X performs Be- 
havior N (By) where T, is specified 
as follows: (a) A set of operations 
has been performed on Person X which 
is known to produce instigation toward 
By in members of the class to which 
X belongs; (b) By exists in the re- 
Sponse repertoire of X, and there are 
no physical restraints or barriers to 
Prevent the performance of By; (c) X 
is not performing By; (d) X observes 
the performance of By by Person Y4 

S (1) defines contagion completely 
in terms of observable events and con- 
ditions. It is necessary to the next 
section of this paper to draw certain in- 
ferences about the Processes underly- 
ing S (1). If Person X is instigated 
to By, is not physically restrained from 
performing By, and is not performing 
By, the implication is that X possesses 
internal restraints against the perform- 
ance of By. Thus, there is an intra- 
personal conflict between the instiga- 
tion to perform By and the internal re- 
straints against performance of By. In 
other words, X is experiencing an ap- 
proach-avoidance conflict. S (1) also 
implies that X’s observation of the 
performance of By by Person Y 


4By is not to be taken as a completely 
specific behavior, but as a behavior class. 
For example, aggression is a class of be- 
havior, while the verbal statement, hei 
idiot !,” is simply one member ot hat £ i 
Thus, if the model says, You idiot !, p: 
the observer says, "I'll poke you n : 
nose!,” both statements are to be considere 
as Bx. 


changes the relative strengths of the 
approach tendency and the avoidance 
tendency. The implication is supplied 
by the fact that X changes his behavior 
after observing the behavior of Person 
Y. The discussion of what changes 
take place in the approach and avoid- 
ance tendencies will be deferred until 
the theoretical section of the paper. 
An example of the general situation 
described in S (1) might be appropri- 
ate at this point. Every Sunday an in- 
dividual, X, is frustrated in his quest 
for sleep by the sound of the power 
mower of the neighbor across the street, 
Z. Because of the strong restraints 
against directing aggression toward a 
neighbor in our culture, S has confined 
himself to politely requesting every 
Sunday that Z wait until afternoon to 
mow his lawn, One Sunday, as X 
is approaching Z to make this request, 
another neighbor, Y, also approaches 
Y curses Z and threatens him with 
physical destruction, Without even 
Waiting for Z's reaction to this attack, 
X also becomes quite aggressive toward 
In the remainder of this paper, Per- 
son X of both the definition S(1) and 
the example above Will be referred to 
as the observer. Person y of S (1) 


and the above example will be referred 
to as the model, 


RELATIONSHIP BETWEEN CONTAGION 
AND OTHER TYPES or Socran 
INFLUENCE 

One may 
havioral co 
psychologis 


ask why the concept of be- 
ntagion is necessary when 
ts already have conformity, 
imitation, social facilitation, and so- 
cial pressures (not to mention half a 
dozen other related terms). Let us 
briefly examine some of these terms. 
The term “conformity” has been 
used so indiscriminately that one must 
be somewhat arbitrary in defining it, 
The mass of recent research on con- 


ye 
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formity has been concerned with the 
Asch effect or modifications thereof. 
Consequently, the definition of con- 
formity presented here will be of Asch- 
type conformity. Such a definition 
might be written as follows: If Line ab 
is sufficiently longer than Line cd so 
that Person X has in the past normally 
given the correct judgment of relative 
line length when making this judg- 
ment in the absence of other indi- 
viduals, and if Person X is asked to 
make this judgment immediately after 
and in the presence of three or more 
other individuals who have audibly 
stated that Line cd is longer than Line 
ab, then Person X is conforming if and 
only if he states that Line cd is longer 
than Line ab. 

This definition could, of course, be 
written in somewhat more general 
terms.  Asch-type conformity is not 
restricted to the judgment of line 
length and has, of course, been ex- 
tended into opinions. But regardless 
of what is being judged, it is apparent 
from the above definition that conflict 
is produced in Person X by the judg- 
ments of other individuals. The con- 
flict is between making the subjectively 
veridical but socially rejected judgment 
and making the subjectively counter- 
factual but socially accepted judgment. 
According to the definition, Person X 
must resolve this conflict by himself. 
That is, the actions of the other indi- 
viduals create the conflict and do not 
aid in resolution of the conflict. 

On the other hand, the reduction 
sentence, S (1), used to define con- 
tagion implies that conflict in Person 
X exists Prior to the appearance of 
other individuals and that the presence 
of other individuals contributes to con- 
flict resolution. "Thus, there is a defi- 
nite and major difference between con- 
formity and contagion. 

An interesting example of the differ- 
ence between conformity and contagion 


may be seen in the effect of a nonunani- 
mous majority in the Asch situation. 
Prior to the appearance of a partner 
who gives the veridical judgment, the 
subject (S) is instigated to give the 
veridical judgment, but experiences in- 
ternal restraints against doing so be- 
cause the unanimous majority has given 
a different judgment. S resolves this 
conflict created by the majority by 
giving the nonveridical judgment; the 
socially created restraints against giv- 
ing the veridical judgment are too 
great. A partner who then begins ta 
give the veridical judgment reduces S’s 
restraints, which is to say that con- 
tagion occurs, and S ceases to conform 
to majority opinion. Thus, we have a 
well-documented situation in which 
contagion is not only directly opposed 
to conformity, but appears to be the 
more powerful of the two phenomena. 

At this point in the paper, it has 
been argued that while contagion and 
conformity both involve conflict, they 
differ sharply in the role other indi- 
viduals play in the conflict. A third 
term which has been of great impor- 
tance in research on social influence is 
"pressures toward uniformity" (Fest- 
inger, 1954). What is the relation of 
this term to contagion and conformity ? 

In essentials, there is no difference 
between conformity and pressures 
toward uniformity. In both cases, the 
individual possesses an opinion or be- 
lief which he then finds to be in conflict 
with the opinions or beliefs of others. 
In both cases, the individual must re- 
solve this conflict. Any appearance of 
differences between the two types of 
social influence is due to the greater 
elegance of the pressures toward uni- 
formity formulation and to different 
research techniques. That is, experi- 
ments conducted to investigate pres- 
sures toward uniformity have typically 
involved face-to-face discussions of 
opinions in which pressures toward 
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uniformity may be evidenced by the 
individual changing his own opinion 
(the usual evidence for conformity), 
by the individual changing the opinions 
of others (impossible in an Asch-type 
situation), or by the individual reject- 
ing the others as comparison persons 
(possibly and likely in an Asch-type 
situation, but the relevant data are 
never collected). . 
Another type of social influence is 
"social facilitation.” ^ According to 
'Thorpe (1956, p. 120), social facilita- 
tion is said to occur when "the per- 
formance of a more or less instinctive 
pattern of behavior by one member of 
a species will tend to act as a releaser 
for the same behavior in another or in 
others, and so initiate the same lines 
of action in the whole group." Thorpe 
is an ethnologist and would probably 
forgive students of human behavior for 
insisting that the behavior need not be 
instinctive. Bandura and Walters 
(1963, p. 79) provide the following il- 
lustration of social facilitation (or in 
their terms, of the eliciting effect of a 
model) : 


An obvious eliciting effect may be observed 
in cases in which an adult, who has lost the 
idioms and pronunciations of the local dialect 
of the district in which he was raised, returns 
for a visit to his home. The original speech 
and pronunciation patterns, which would take 
a stranger years to acquire, may be quickly 
reinstated. 


One may easily demonstrate social 
facilitation by reaching for a cigarette 
during a conversation. Other smokers 
will also light cigarettes, 

Social facilitation was nicely demon- 
strated by Starch (1911) by having Ss 
copy in longhand literary passages 
written in longhand. Although Ss 
were not instructed to do so, they imi- 
tated the longhand style in which the 
passages were written. Starch called 
this phenomenon “unintentional or un- 
conscious imitation." 


In such situations there is no ap- 
parent conflict. "There would appear to 
be no definite restraints and no particu- 
larly noticeable instigation. By what 
process then does social facilitation oc- 
cur? There are several candidate 
answers which are probably all ac- 
curate to some extent. 

In the first place, it is probably true 
that merely conceiving of a behavior 
increases its probability of occurrence 
and is usually the first step in perform- 
ing that behavior. If there are no re- 
Straints present, observing and thus 
conceiving of a behavior is likely to 
lead step by step to performance, We 
may call this cognitive-behavioral 
chaining. 

In the second place, there is probably 
no behavior in one's repertoire which is 
entirely without restraints. If there are 
no other restraints, mere inertia con- 
stitutes a restraint. Observing some- 
one else perform a behavior reduces 
these minor restraints; after all, it did 
not seem to hurt the other fellow. 

Third, there are many behaviors 
which are conditioned Tesponses to the 
conditioned stimulus of another person 
performing the behavior, Thus, we 
frequently yawn in the Presence of 
others who are yawning because all 
members in the group have approxi- 
mately the same need for oxygen. After 
this has happened on a number of oc- 
casions, one’s personal yawn may be 
elicited solely by the sight of another 
person yawning. 

In short, social facil 
ated by several factor. 
in combination. It is distinguished 
from conformity, cont 


agion, and social 
preme by lack of any marked con- 
ict. 


What of the by now almost generic 
term “imitation”? The work of Miller 
and Dollard (1941) is, of course, the 
Cassic in the field of imitation. The 
primary argument in their book is that 


itation is medi- 
S, separately or 


E punishment and that imitation then same; the model is the same; the ef- 
generalizes across drives, models, and fective responses are the same), any 
situations. These authors describe imitation learned in Situation Al will 
two ways of learning imitation: manifest itself in Situation A2. 
matched-dependent behavior and copy- It is apparent that if one wishes to 
mg define imitation as I have attempted to 

| Th: both: cases; thie resone are cxondcted define other types of social influence, 

f to a cue from an independent subject or OPE could not better the definition of 
model. In both, likewise, the punishment of English and English (1958), “action 
ees ie qe responses and the reward of that copies the action of another more 
formit Rp e tee Imm Nw er less exactly, with or without intent 
the follower. The essential difference be. tO COPY [p. 253]." Thus, imitation 
tween the two processes is that in matched- may be considered a generic term sub- 
et Sie He imitator responds suming contagion, conformity and so- 

m. éabyinede cheese cis - — pod es cial pressures, and social facilitation. 

"a and difference produced by stimulation from In order to make the relationships 
his own and the model's responses [p. 159].  4Inong the several uniformity processes 

clearer to the reader, Table 1 describes 
After learning to imitate, it presum- social pressures and conformity, con- 
ably makes no difference whether learn- — tagion, and social facilitation in terms 
ing took place via matched-dependent of initial conditions for the observer, 
. behavior or copying. Following es- the model’s behavior, the hypothetical 
tablished principles of stimulus and processes occurring in the observer, 
response generalization, learned imita- and finally the observer's behavior. 
tion will have more or less of an effect Table 1 performs the function of dis- 
: in different situations. tinguishing between the various social 
j Stated somewhat differently, to the influence processes. Another neces- 
J| extent that Situation A2 is similar to sary scientific function is to synthesize 
T the learning situation, A1 (i.e. the ob- phenomena, and this function is at- 
i server has the same drive; the discrimi- tempted in Figure T. 
V TABLE 1 
Dynamics OF SELECTED INFLUENCE PROCESSES 
Pog Sagen i ifin Behavioral contagion Social pranua Soçial facilitation 
2 Observer's initial Instigated to By. In- | Instigated to Bp. No | No restraints against 


jos 
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imitation is learned via reward and 


native and background stimuli are the 


conditions 


ternal restraints 
against Bx. 


restraints. 


By or Bp. No instiga- 
tion to By or Br. 


Model's behavior 


Model performs Bx. 


Model performs Bx. 


Hypothetical 
processes 


Reduction of model's 
restraints against Bx. 
Fear reduction. 


Creation of restraints 
against Bp. Conflict 
between By and Bp. 


Model performs By. 

| e Ae ure 
Cognitive-behavioral 
chaining, CS elicits 
CR, inertia overcome. 


Observer's behavior 


Observer performs 
Bx. 


Observer performs Bx 
(or rejects model or 
induces model to per- 
form Bp). 


Observer performs By. 
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CONFORMITY 


Corona Y 


Fic. l. Three-dimensional model of con- 
formity, contagion, and social facilitation. 
(The model performs Bx; B» is any and all 
behavior or behaviors other than Bx. The 
strength of forces in the conflict is the sum 
of the strength of forces in the observer 
toward Bx and toward Br. Combined 
strength of forces ranges from 0.0 to C, 
where C is the maximum possible in any 
conflict. The ratio of By to B» forces in 
the observer is directly related to the proba- 
bility that Bx will be performed. The case 
in which the ratio of Bx to B» forces is 
greater than 1.0 is not relevant to the 
discussion.) 


The front of the block in Figure 1 is 
concerned with the combined strength 
of forces in the observer's conflict both 
prior to and after the performance by 
the model of Behavior N (Bx). Note 
that for any initial level of forces in the 
conflict, the forces in contagion must 
be to some degree less after the be- 
havior of the model, and in conformity 
they must be to some degree greater 
after the behavior of the model. This 
may be verified by drawing anywhere 
on the face of Figure 1 a line parallel 
to the ordinate. Let the intersection 
of this line with the abcissa be +, and 
let the distance along the abcissa from 

x to the zero point be the x intercept. 
Now draw anywhere on the face of 
Figure 1 a line parallel to the abcissa. 
Let the intersection of this line with 
the ordinate be y, and let the distance 


along the ordinate from y to the zero 
point be the y intercept. If the two 
lines so constructed intersect in the 
contagion area, the x intercept will be 
greater than the y intercept. If the 
two lines intersect in the conformity 
area, the x intercept will be less than 
the y intercept. 

If the two constructed lines intersect 
in the social facilitation area, the x 
intercept may be greater than, less 
than, or the same as the y intercept. 
This is consistent with the earlier rep- 
resentation of social facilitation as being 
mediated by some combination of re- 
straint reduction, elicitation of a con- 
ditioned response, and cognitive-be- 
havioral chaining. Restraint reduction 
would, of course, reduce the combined 
strength of forces, while elicitation of 
a conditioned response and cognitive- 
behavioral chaining would act as addi- 
tional forces and would thus increase 
the combined strength of forces, Fur- 
thermore, social facilitation is shown to 
involve small forces both before and 
after the behavior of the model, Social 
facilitation overlaps with both conform- 
ity and contagion, thus describing the 
twilight zone in which these latter two 
phenomena may be indistinguishable. 

The third dimension of the block in 
Figure 1 represents the ratio of Bx 
forces (Bx is the behavior performed 
by the model) to Bp forces (Bp is 
any other behavior) prior to the 
model’s performance of By. It is indi- 
cated that for contagion the By forces 
are slightly less than the Bp forces, 
and we know from the face of the block 
that the sum of the By and Bp forces 
is relatively large prior to the behavior 
of the model. On the other hand, the 
forces involved in social facilitation 
are approximately equal. and it is 
shown on the face of the block that the 
sum of these forces is small; that is 
there is high lability, but no marked 
conflict. In the case of conformity 


met 
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the Bp forces are greater than the By 
forces, but the ratio varies with the 
type of situation. In the easy discrimi- 
nation kind of situation originally used 
by Asch (1956), Bp forces are far 
greater than By forces. When opinions 
are at issue (cf. Festinger, 1954), the 
ratio of Bx to Bp forces is somewhat 
greater, and the ratio is even greater in 
the  autokinetic situation (Sherif, 
1936). This distinction of various con- 
formity situations is, of course, con- 
sistent with the well-known fact that 
conformity varies directly with the 
ambiguity of the stimuli to be judged. 
Tt should be noted that Figure 1 does 
not contain the situation in which the 
ratio of By to Bp forces is greater than 
1.0 prior to the model’s performance of 
By. This is because such a ratio 
would lead the observer to perform By 
independent of the model’s perform- 


ance of By. There would be no social 
influence, 


CONTAGION : EMPIRICAL REVIEW AND 
THEORETICAL IMPLICATIONS 


Little research relevant to behavioral 
contagion has been conducted. What 
research that has been done will be or- 
ganized according to the major ques- 
tion(s) to which the research con- 
tributed. After research relevant to 
any one question has been described, 
theoretical implications of the research 
will be discussed. The central theoreti- 
cal statement to the following discus- 
sion is that behavioral contagion as de- 
fined in S (1) is mediated by the low- 
ering of the observer’s avoidance gradi- 
ent in an approach-avoidance conflict. 


Specificity of Response Matching 


In an experiment by Grosser, Polan- 
sky, and Lippitt (1951), Ss were told 
that certain unspecified toys in a col- 
lection of toys were forbidden. In this 
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way restraints against playing with any 
of the toys were created. Ss played 
with the toys more when they observed 
an experimental confederate playing 
with the toys than when they did not. 
In addition, Ss tended to play with toys 
they had initially rated as most at- 
tractive, regardless of which toys the 
confederate played with. The investi- 
gators attributed the successful con- 
tagion to a change in the overall per- 
ceived safety of the situation. 

Similar results with older Ss were 
obtained by Wheeler, Smith, and Mur- 
phy (1964). Army recruits were 
placed in a waiting room containing 
magazines, darts, and shuffleboard. 
They were told that they could read 
magazines while they waited. Thus, 
mild restraints against playing with the 
games were created. Ss played with 
the games more frequently when they 
observed an experimental confederate 
playing with one of the games than 
when the confederate read or simply 
walked about the room or when the 
confederate was not in the room. This 
was true whether or not the quantity 
of game equipment allowed S to play 
the same game as the confederate. 
Thus, contagion was demonstrated un- 
der conditions that made specific re- 
sponse matching impossible. 

These experiments suggest that de- 
pending upon the initial strength of the 
approach and avoidance gradients, the 
observer's behavior may or may not be 
an exact imitation of the model’s be- 
havior. In a situation involving sev- 
eral potential actions directed toward 
the same end or expressing the same 
need pattern, observation of the model's 
performance of Behavior A will reduce 
the observer's fear of performing A! as 
well as A. Depending then upon the 
relative strengths of the approach gra- 
dients for A and A!, the observer may 
perform A! as a consequence of wit- 
nessing the performance of A. 
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Balancing of Approach and Avoidance 
Tendencies 
- Mand Blake (1958) created 
wee of thirst in Ss. With an 
experimental confederate, Ss were 
placed in the vicinity of a water foun- 
tain with a sign which read: “Do not 
use this fountain.” The measure of 
contagion in each of the two thirst con- 
ditions was the difference in 
centage of Ss who drank if 
federate did not drink and the percent- 
age who drank if the confederate did 
drink. Contagion was greater in the 
Weak Thirst condition because most 
Ss in the Strong Thirst condition drank 
regardless of what the confederate did. 
Freed, Chandler, Blake, and Mouton 
(1955) obtained somewhat similar re- 
sults. Ss approaching the main door 
of a university building encountered a 
sign urging use of an inconvenient side 
door. In three separate conditions, the 
sign implied three different degrees of 
prohibition. An experimental con- 
federate either did or did not conform 
to the injunction in each of the condi- 
tions. The measure of contagion in 
each condition was the difference be- 
tween the percentage of Ss obeying the 
sign when the confederate conformed 
and the Percentage obeying the sign 
when’ the confederate deviated. This 
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straints are far stronger than the ap- 
proach tendency, massive fear reduction 


is necessary to produce a change in 
behavior. If the avoidance gradient is 


below the approach gradient, the be- 
havior will be performed whether the 
model is present or not. 


Consequences to the Model of his Be- 
havior 


Bandura, Ross, and Ross (1963b) 
exposed children to a television pro- 
gram depicting an aggressive model. 
In one condition, the model was re- 
warded for his aggression; that is, the 
model obtained good things for being 
aggressive. In a second condition, the 
model was thrashed for this aggression. 
After viewing the program, children 
were placed in a room containing nu- 
merous toys that could be used to re- 
produce the model's aggressive re- 
Sponse patterns. Children who had ob- 
served the rewarded model weri 
aggressive than children who I 
served the punished model, 

Walters, Leat, and Mezej 
placed children.in the Vicinity o 
tractive assortment of toys with 
they were forbidde 
children were then s 
other child playing w 
of the children saw 
by his mother for pla 
Others saw the child rewarded through 
his mother's nurturant interaction. 
After viewing the film, the children 
were left alone in the room containing 
the toys. Children who had seen the 
model rewarded played with the toys 


more frequently than children who had 
seen the model punished, 


t is apparent] 


had ob- 


(1963) 


f an at- 


n to play. The 
hown a film of an- 


ith the toys. Some 
the child rebuked 
ying with the toys. 


Y not necessary for the 
Observer to know the consequences © 

the model’s behavior, however. Band-, 
ura, Ross, and Ross (1963a), Walters 
and Thomas (1963), and Walters, 
Thomas, and Acker (1962) found Ss 


to be more aggressive after watching 


€ more , 
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af aggressive model who had not been 
obviously rewarded or punished for 
aggression. It is probably the case that 
if the observer expects the model to be 
punished, the lack of any overt punish- 
ment is perceived as rewarding, 

Walters and Parke (1964) and 
Walters, Parke, and Cane (1965) 
found that children violated, prohibi- 
tions against playing with toys less af- 
ter having observed a punished model 
than after having observed a rewarded 
model or a model who was neither re- 
Warded nor punished. 

An implication of these studies is 
that the observer vicariously performs 
the behavior performed by the model, 
and the observer vicariously experi- 
ences the consequences of the behavior 


“which are overtly experienced by the 


` model. 
, model is rewarded or not punished, the 


observer's avoidance gradient is low- 
` ered. 


To the extent then that the 


Characteristics of Effective Models 


Polansky et al. (1950) and Lippitt, 
Polansky, and Rosen (1952) found 
that children high in prestige were the 
most effective models for contagion. 
However, many of the incidents of 
contagion tabulated and analyzed did 
not involve restraints, and thus the 
conclusions from the two studies are 
not as relevant to the current discus- 
sion as they might be. A field experi- 
ment on the contagion of aggressive 
behavior, done within the larger study 
by Polansky et al. (1950), found ef- 
fective models to be highly impulsive. 


This may mean simply that impulsive 
boys acted sooner than others and thus 


had more opportunity to become 


models. 

Lefkowitz, Blake, and Mouton 
(1955) investigated the contagion of 
pedestrians crossing against a red 
light, Contagion was greater when 
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the initiator was a well- i 
status individual — M i p 
a n he was 
poorly dressed. 
a rs pese ae regard to the status 
model su 
characteristics of cipe toe prem 
the same effect as ; ia e n ive 
immediate conse. 
quences of the model's behavior m 
the observer perceives the model en- 
gaging in behavior which has no i. 
mediate visible consequences, the ob 
server may estimate the likelihood of 
the model's subsequent punishment or 
reward. One basis for such an esti 
mate is the previous success of the 
model in obtaining rewards and avoid. 
ing punishments. Thus, to the extent 
that the model appears to be or is 
known to be generally rewarded or not 
punished for whatever behavior he 
emits, the observer's avoidance gradi- 
ent is lowered for whatever behavior 
the model performs. 


Characteristics of the Observer 


Polansky et al. (1950) and Lippitt 
et al. (1952) found that observers most 
susceptible to influence through con- 
tagion had high prestige. The field 
experiment on the contagion of ag- 
gression, also reported by Polansky 
et al. (1950), indicated that impulsive 
children were most likely to be influ- 
enced toward aggression through the 
observation of aggressive models. 
These findings duplicated exactly those 
concerning effective models. 

Smith, Murphy, and Wheeler (1964) 
found no relationship between con- 
forming in an Asch-type situation and 
being influenced by a model to violate 
a prohibition against playing with 
games available in a waiting room, 
Moreover, while the usual relationship 
between conformity and authoritarian- 
ism was obtained, there was no rela- 
tionship between contagion and au- 


thoritarianism. 
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There is evidence that susceptibility 
to contagion is increased by mea or 
extreme physiological states. , Nova- 
kovsky (1924) linked starvation, fa- 
tigue, illness, and extreme cold to sus- 
ceptibility to modeling influences. A 
number of the incidents mentioned in- 
volved behavior that possessed poten- 
tially negative consequences. Schach- 
ter and Singer (1962) found that an 
adrenalin injection increased the con- 

tagion of anger, particularly if S could 

not explain his unusual physiological 
state as a product of the injection. 

Bandura et al. (1963b) investigated 
sex differences in observers in an ex- 
periment on the contagion of aggres- 
sion. Ss watched either a televised 
nonaggressive model, a model punished 
for his aggression, or a model rewarded 

for his aggression. After observing the 

adult male models, the children were 
placed in a playroom, and aggressive 
behavior was observed. Compared to 

a control condition in which no model 

was observed, total aggression ex- 

pressed by the females was lowered by 
the nonaggressive model and not af- 
fected by the other models. In con- 
trast, the total amount of aggression 
produced by the males was decreased in 
the model-punished condition and in- 
creased in both the model-rewarded 

condition and the condition in which a 

nonaggressive model was observed. 

These results were explained in a 

reasonable although post hoc manner, 

and it is not necessary to repeat the 
explanation here. 

The evidence relevant to character- 
istics of the observer having been re- 
viewed, no general statement tying the 

5 Other studies have shown the sex of 
the observer to influence contagion. Such 
results are important to the present discus- 
sion in pointing out that socialization creates 
restraints and dispositions that affect sus- 
ceptibility to contagion in different ways 
depending upon what behavior is involved 
in the contagion. 


experiments together is readily ap- 
parent. We will let the review stand 
on its own merits. The only point that 
might be stressed is that the kind of 
person most susceptible to influence 
through contagion does not appear to 
be anxious, authoritarian, or of low 
self-esteem. 


Locus of Restraints 


It has been argued in this paper that 
contagion is mediated by restraint re- 
duction or fear reduction. Thus, it 
may be useful in making predictions 
concerning contagion to recognize that 
the locus of restraints may be different 
from one situation to another. As the 
locus of restraints changes, what is 
feared should change. In turn, situ- 
ational determinants of contagion may 
change. While there has been no re- 
search directed toward this possibility, 
certain predictions present themselves. 

If the restraints are group derivéd, 
the observer will fear rejection by the 
group for his lack of impulse control. 
The most effective model should be a 
solid supporter of the group norms, an 
individual who is quite unlikely to be 
rejected. If this individual performs 
the forbidden action, he is in essence 
declaring that the group no longer for- 
bids the action, thus reducing the ob- 
server's fear. However, if there is too 
much of a status differential between 
the model and the observer, the ob- 
server may feel that the model is usin 
"idiosyncrasy credits” (Hollander, 
1958) which he himself does not pos- 
sess. In such a situation, fear would not 
be reduced, and contagion would not 
occur. To the extent that the observer 
1s attracted to the group and does not 
have access to other desirable groups, 
fear of rejection should be strong, and 
Contagion should be less likely to occur, 
To the extent that other group mem- 
hers express disapproval of the model’s 
action, contagion will not occur, This 
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effect will be most striking when dis- 
approval is expressed by high-status 
group members. In general, when the 
restraints are group derived and a 
group member performs the forbidden 
act, the behavior should spread quickly 
throughout the group. Every member 
that. performs the act reduces the re- 
straints further. 

In a different situation, the locus of 
restraints is the observer's superego. 
The observer fears guilt and regret for 
engaging in the behavior. Thus, the 
model’s action must somehow reduce 
the anticipation of guilt by the observer 
which is possible by making the action 
seem more “right” and less “wrong.” 
For example, Bandura and Walters 
(1959) have suggested that adolescents 
are more likely to engage in sexual in- 
tercourse during double dating than at 
other times. When there are superego 
restraints, the apparent morality of the 
model should be of great importance. 
An adolescent girl is not likely to have 
her restraints against sexual behavior 
lowered if the model is either a prosti- 
tute or a nun. The prostitute model 
could not reduce the observer’s fear of 
guilt because of the model’s apparent 
lack of sexual morality. The nun 
model would be rejected as a false nun. 
The most effective model should have 
slightly higher apparent sexual moral- 
ity than the observer. Another predic- 
tion is that to the extent that the model 
performs the behavior without overt 
guilt or fear responses, the observer's 
fear should be reduced. 

Ina still different situation, the locus 
of restraints is an authority figure. The 
optimal conditions for eliciting con- 
tagion in such a situation are complex 
As pointed out earlier, the most ef- 
fective model is often one who generally 
appears to escape punishment for his 
behavior. But the model who fits this 
description perfectly is often not an 
effective model because he is not sub- 
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ject to the authoritarian restrictions in 
the first place. Thus, a speeding police 
car is not likely to reduce restraints 
against breaking the speed limit. 

The most effective model for reduc- 
ing authoritarian restraints would be 
one who performed the behavior in the 
presence of the authority without pun- 
ishment by the authority. In addition, 
the model must originally appear to be 
subject to the same authoritarian re- 
striction as the observer. 

An experiment by Milgram (1965) 
provides an example of contagion when 
the locus of restraints is an authority 
figure. Ss were instigated to cease 
shocking another individual, but they 
were ordered by an authority to con- 
tinue the shocking. Two models who 
were apparently under the same orders 
from the authority refused to continue 
the experiment and were not punished 
by the authority for their disobedience. 
Virtually all Ss ceased to obey the au- 
thority. On the other hand, models 
who obeyed the authority had no effect 
at all, as one would predict from the 
theory of contagion. 

Unlike the situation in which the lo- 
cus of restraints is the superego, simi- 
larity between the observer and model 
is probably not relevant. This is because 
the observer fears punishment from an 
external source, and any model who 
performs the behavior without nega- 
tive consequences will reduce re- 
straints, no matter how dissimilar he 
may be to the observer. 

There is no intention to imply that 
the discussion of group-derived re- 
straints, superego restraints, and au- 
thoritarian restraints is exhaustive; 
there are undoubtedly other common 
loci of restraints. Nor is there any 
intention to imply that the usual situ- 
ation has only one locus of restraints. 
The intention is merely to suggest the 
potential predictive value of specifying 
the locus of restraints. 
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Interpersonal Consequences of Con- 
tagion 
Festinger, Pepitone, and Newcomb 
(1952) hypothesized that when mem- 
bers of a group are deindividuated, in- 
ternal restraints are reduced, and pre- 
viously suppressed behaviors are per- 
formed. This in turn leads to satisfac- 
tion with the group. Essentially the 
same hypothesis was suggested by 
Hoffer (1951). Some support for this 
hypothesis was provided by Festinger 
et al. (1952) who found (a) a negative 
correlation between the frequency of 
negative statements about parents and 
the ability to remember which person 
in the group said what, and (5) a posi- 
tive correlation between the frequency 
of negative statements about parents 
and the desire to have further discus- 
sions with the other group members. 
The first correlation was interpreted 
as a positive relationship between re- 
straint reduction and deindividuation, 
while the second correlation was inter- 
preted as a positive relation between re- 
straint reduction and attraction to the 
group. 

Singer, Brush, and Lublin (1964) 
found that in groups of girls discussing 
pornographic literature, members were 
more attracted to groups in which ob- 
scene words were spoken. This was 
interpreted as a positive relation be- 
tween restraint reduction and attraction 
to the group. 

Deindividuation is a feeling of ano- 
nymity, and one would expect it to 
exist to the extent that the individual 
cannot be singled out as the only per- 
son performing a behavior. Thus, ob- 
servation of a model performing a be- 

havior should produce in the observer 
some feeling of anonymity with regard 
to his own performance of that be- 
havior. If the model appears to be 
“getting away with it,” the overall re- 
sult should be a lowering of the ob- 


server’s restraints against performing 
the behavior, the restraints being 
lowered both by the feeling of ano- 
nymity and by the knowledge that the 
behavior will not be punished. Now it 
is probably true that whenever re- 
straints are reduced, by whatever 
means, to the point of behavior oc- 
curring and going unpunished, there 
will be satisfaction with the situation. 
One type of satisfaction with the situ- 
ation would be “liking” for the model 
and a knowledge that he and the model 
have something in common. It has 
been shown in a series of studies that 
when an individual perceives some at- 
tribute in common between himself and 
a model, he will tend to perceive him- 
self as having other of the model’s at- 
tributes (Burnstein, Stotland, & Zan- 
der, 1961; Stotland & Dunn, 1962, 
1963; Stotland & Hillmer, 1962; Stot- 
land & Patchen, 1961; Stotland, 
Zander, & Natsoulas, 1961). From 
these considerations, we may predict 
that after the occurrence of contagion in 
a situation in which the observer is 
not punished for his behavior, the ob- 
server will like the model more than 
previously and will feel more similar 
to him on a variety of dimensions, that 
is, he will be less individuated from the 
model. i 

Note that while in the Festinger et 
al. (1952) formulation deindividuation 
leads to restrain reduction, the con- 
tagion formulation suggests that most 
of the deindividuation occurs after the 
restraint reduction. 


CoxcLuDING REMARKS 


This paper has focused upon ex- 
amples of contagion which involve soci- 
ally undesirable behavior, such as ag- 
gression, adolescent sexual intercourse, 
and violation of formal prohibitions. 

But it is not necessary that contagion 
be confined to socially undesirable be- 
havior. Experiments have shown that 
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volunteering behavior is increased by 
the observation of others volunteering 
(Rosenbaum & Blake, 1955; Schachter 
& Hall, 1952). The latter authors 
specifically attributed the effect of vol- 
unteering models to the reduction of 
group-derived restraints. 

The incident in the Kew Gardens 
section of New York in which a num- 
ber of people observed a woman at- 
tacked three times and eventually 
murdered could probably have been 
prevented by one individual. Had any 
one individual made a visible attempt 
to aid the victim, the fear would 
probably have been reduced in other 
of the observers, who would have then 
acted. 

This paper has dealt with socially 
undesirable behavior because when the 
behavior elicited through contagion is 
socially desirable, it is sometimes dif- 
ficult to rule out other explanations. 
For example, there are probably strong 
pressures upon an individual to volun- 
teer when those around him are volun- 
teering. However, it should be em- 
phasized that theoretically contagion 
is not limited to socially undesirable 
behavior. The theory is concerned 
with all behavior which the observer 
perceives as possibly bringing negative 
consequences to the performer. 
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PSYCHOLOGICAL REVIEW 


SHORT-TERM VERBAL MEMORY AND LEARNING: 


LLOYD R. PETERSON 


Indiana University 


A number of paired-associate experiments are examined for evidence 
= of the interaction of a short-term storage with a long-term learning 
EM mechanism. Differential effects depending on length of the retention 

"^ interval are described for duration of the spacing interval between 2 
presentations of individual pairs, duration of presentations, and number 
of repetitions. The rapidly decreasing initial portion of the retention 
curve with multiple paired associates is considered to be the result of 
interference and the passage of time. The later portion of the short- 
term retention curve is considered to largely reflect an associative 
mechanism which may produce stronger associations with time. A 
model of the interaction of these mechanisms with a guessing 


component is presented. 


Hebb (1949) has written a detailed 
account of a possible relationship be- 
tween short-term memory and learn- 
ing. He hypothesized that recall for a 
brief period of time following stimula- 
tion was made possible by the rever- 
beration of neural circuits, and that this 
reverberation also participated in the 
formation of a structural memory trace. 
A single trial by itself was not consid- 
ered to leave a lasting aftereffect. 

F More recently, research in short- 
erm verbal retention has provided data 
from which inferences have been made 
which appear to support Hebb's con- 
cept of a transient activity trace. 
Brown (1958) obtained results after a 
1The investigations 
been supported E National ‘oo e 
tion Grant GB 704. The paper was written 
at the Institute of Human Learning, Univer- 
sity of California, Berkeley. The author is 
indebted to N. H. Anderson and E. Tulving 
for criticisms of portions of drafts of the 


Paper. 


single presentation which he interpreted 
in terms of a rapidly decaying trace. 
He considers that decay is rapid re- 
gardless of whether or not interfering 
activities occur. Repetition does not 
strengthen the trace, according to this 
position, but only puts off the onset of 
the decay. Broadbent (1963) suggests 
that it is possible that decay is charac- 
teristic of traces which have been re- 
peated several times, as in learning 
over trials. He holds that short-term 
memory deteriorates rapidly because of 
the time that interfering activities take, 
rather than due to the similarity of the 
activity to the material being tested. 
In this respect he considers short-term 
memory to differ from long-term mem- 
ory, in which retention has been shown 
to be related to similarity of the inter- 
fering activity to the learning. . 
In contrast to this point of view 
there is a growing trend for many in- 
vestigators to consider that short-term 
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retention involves processes similar to 
those involved in long-term retention. 
For instance, Peterson and Peterson 
(1959) have suggested that a brief 
presentation of three unrelated letters 
can be conceptualized as a trial of 
learning. They tested recall after in- 
tervals up to 18 seconds during which 
the subject engaged in counting activ- 
ity. Latencies were often long, and 
their probabilistic nature resembled 
that of recall in a learning experiment, 
Further, repetition improved recall in 
an orderly fashion. Again, Hebb 
(1961) has modified his position as a 
result of a verbal memory experiment. 
He gave subjects a succession of im- 
mediate-memory tests in which every 
third sequence of digits was ordered 
identically. There was an improve- 
ment with repetition even though dif- 
ferent orders of the same digits were 
being tested between repetitions. This 
suggested to Hebb that something more 
permanent than an activity trace re- 
sulted from a single presentation, Mel- 
ton (1963) elaborated Hebb’s experi- 
ment, concluding that a structural trace 
was produced by a single presentation, 
and that short-term memory operates 
by essentially the same mechanisms as 
does long-term memory. He consid- 
ered that the effects of various kinds of 
interference in producing short-term 
forgetting, as well as the effect of repe- 
titions in improving short-term reten- 
tion, are further evidence of the con- 
tinuity of short- and long-term reten- 
tion, 
The present paper outlines a position 
which accepts the Presence of learning 
mechanisms in short- as well as long- 
term retention, but which also provides 
for a recency mechanism which inter- 
acts with the learning mechanisms. It 
is suggested that these Processes work 
concurrently, and that the effective role 
of an individual process varies with the 
recency of the event to he remembered, 


as well as being dependent on various 
situational variables, Learning is ini- 
tiated immediately on one presentation, 
but its presence shortly after presenta- 
tion is largely masked by the operation 
of a recency mechanism, Memory for 
recent events is largely effected by this 
short-term mechanism, and memory 
drops off rapidly when events of the 
same class succeed each other, Mem- 
ory for events in the more distant past 
is made possible by learning mecha- 
nisms, and it need not decrease over 
the course of a short interval, but may 
increase. The relative contribution of 
each mechanism normally varies as a 
function of the length of the retention 
interval and the nature of the events 
occurring during the interval. Evi- 
dence for the manner of operation of 
these mechanisms wil] be sought in var- 
iation of the composition and duration 
of intervals Separating various events 
In tests of retention, 

Most of the experiments which will 
be considered deal with paired associ- 
ates. A Paired-associate task would 
seem to be ideal for the investigation 
of relationships between short-term 
memory and learning. It has been 
studied extensively through the acqui- 
sition of lists of pairs and by tests of 
retention after intervals of time meas- 
ured in minutes, hours, and days, 
More recently, studies have been re- 
ported in which retention of paired as- 
Sociates has been tested over intervals 
measured in seconds from the presenta- 
tion of an individual pair to the cue for 
its recall, 


age interval between pa 


ge ir irings to be in- 
dicative of learning, in 


contrast to re. 
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tention which is studied over longer 


intervals. To the extent that situations 
can be arranged so that short-term re- 
tention studies are similar to list studies 
in important respects, performance at 
the asymptote of the short-term reten- 
tion curve can be considered to reflect 
the same phenomenon that is reflected 
by anticipations over trials, 


INTERFERENCE 


_There is considerable evidence to in- 
dicate that interference from other 
activities is a major source of forgetting 
in short-term tests. However, the 
means by which its effects are pro- 
duced require some consideration. In- 
terference comes in various forms, 
depending on the composition and 
timing of the activities involved. 
> Retroactive interference of the class- 
ical type in which a new response is 
paired with an old stimulus has been 
found in the short-term retention of 
paired associates (Peterson, Saltzman, 
Hillner, & Land, 1962). The standard 
condition consisted of tests of a single 
pairing after a 10-second interval filled 
with presentations and tests of other 
pairs. The retroaction condition dif- 
fered only by the pairing of the cue 
word with a new response midway 
through the retention interval. Recall 
of the initial pairing was correct in 
27 of the tests in the standard con- 
dition, while it was only .14 in the 
retroaction condition. 

Evidence for proactive interference 
of a complementary variety has been 
reported by Friedman and Gelfand 
(1964). They successively paired two 
different responses to two stimuli 
having a common element. Presenta- 
tions and tests of other pairs filled the 
intervals between pairings and between 
the last pairing and the test for reten- 
tion. Recall of the second pairing was 
slightly lower than recall in a control 
condition in which only one response 
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Time (sec) 
Fic. 1. Recall on a second test in relation 


to recall on a first test. (Cı indicates the 
curve for correct recalls on Test 1. EEs 
indicates the dependent probabilities that the 
same incorrect response occurred on Test 2 
as on Test 1. E;C: indicates the dependent 
probabilities that the correct response oc- 
curred on Test 2, given an error on Test 1.) 


had been paired with the common 
stimulus element. The proaction and 
control conditions were tested equally 
often at various stages of the session, 
and the interpretation is clearer than in 
cases where decrement over stages of a 
session is explained in terms of pro- 
active interference. 

Data of relevance to the interaction 
of competing responses to the same cue 
are provided by an experiment in which 
two successive tests followed an initial 
presentation of a pair (Peterson, Saltz- 
man, Hillner, & Land, 1962). The 
tests did not provide knowledge of 
results. The pairs were combinations 
of words and numbers, and the pre- 
sentation or test of other pairs filled 
intervals. The data graphed in Fig- 
ure l are from conditions in which 
either 4 or 8 seconds elapsed between 
presentation and first test, while an- 
other 4 or 8 seconds elapsed between 
first and second tests. Correct recall 
on the first test was found to drop off 
rapidly. Recall on the second test, 
given an error on the first test, de- 
pended on both the retention interval 
before the first test and the interval 
from first to second test. Recall was 
correct in .12 of the cases when the 
second test followed the first by 4 
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n this interval 
eph a ed in both con- 
diens was significantly higher than 
the chance level for 15 alternatives 
(t = 2.31, p < 05; t = 3.87, p < .001). 
In the case of pairs not recalled 8 sec- 
onds after presentation, recall did not 
go appreciably above the chance level 
on the second test. 

There was a suggestion of an overall 
tendency for recall to improve as time 
increased since the error on the first 
test (F = 4.58, df = 1/35, p < .05). 
It seems possible that responses which 
were given incorrectly on the first test 
may have provided an interference 
effect that decreased with time. Evi- 
dence for this interpretation is provided 
by the conditional probability that an 
incorrect response on the first test 
would be followed by the same incor- 
rect response on the second test. These 
conditional probabilities decreased over 
the interval following the first test, 

and the decrease was about the same 
for both intervals between presentation 
and first test. The forgetting curves 
for these errors resemble that of the 
correct response, suggesting a short- 
term storage equally efficient for pair- 
ings read correctly from the drum as 
for pairings spoken in error, It seems 
plausible that an erroneous response 
made to a cue competed with later 
correct recall to that same cue. As the 
erroneous pairing was forgotten, 
call of the correct response recover 
The interaction of the initially paired 
response with the erroneously paired 
response may provide a Paradigm for 
competition generally. The last oc- 
curring event is very strong initially, 
but it decreases with time to become 
approximately equal in strength to the 
earlier event, as both approach the 
asymptotes of their retention curves, 


re- 
ed. 


2 Tke values for these conditions listed in 
the original paper are incorrect. 


There is a relative increase in strength 
of the earlier pairing as time passes. 
Since pairings which were incorrect 
on the first test were later correct in 
substantial numbers on the second test, 
the results are evidence against any 
simple all-or-none interpretation of 
short-term forgetting. A pairing is 
not forgotten once and for all, but it 
may become available later. Of course, 
if two mechanisms are involved in 
short-term retention, then one could 
lose information gradually, while the 
other had all-or-none characteristics. 
The short-term mechanism could in- 
volve gradual cessation of activity, 
while the learning mechanism produced 
abrupt changes. There is some sug- 
gestion that this might be the case in 
the finding of slower recovery when an 
8-second interval elap: 
tation to test. 
vals tested rea 
pected by gue: 
It is not poss 
curve whether 
be expected at 


sed from presen- 
Recovery over the inter- 
ched only the level ex- 
ssing in this condition. 
ible to judge from the 
further recovery might 
longer intervals. 


NUMBER or Parrs 


Pairs not having members in com- 
mon with the pair under test can also 
be shown to provide competition at 
recall, The decrease in recall that 
comes with an increase in the number 
of pairs recently presented is such an 
instance. The limiting comparison of 
differing numbers of pairs arises when 
retention of one pair is compared with 
retention of two pairs, Very little 
forgetting of a single pair of words 
has been found over intervals up to 
16 seconds filled with counting activity, 
while marked forgetting has been found 
with two pairs (Peterson & Peterson, 
1962) ether this effect of one 
ir is due to a difference in 


associative learning is 


open to question. With only one pair 
there can be no test of a discrimination 
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between cues. The subject responds 
with the only response that was pre- 
sented, and recall in this case is for 
an individual item. In the case of 
retention of two pairs, it must be noted 
that there was a primacy effect. The 
recall of the first pair was better than 
recall of the second pair, time since 
presentation being equal. Since con- 
siderations of retroactive and proactive 
interference would not suggest an in- 
equality in this direction, another factor 
or factors would appear to be involved. 
A greater disparity in stimulus con- 
ditions from presentation to test in the 
case of the second pair, when the 
stimulus situation is considered to ex- 
tend over recent events, is one con- 
sideration. Another is the likelihood 
that learning of the second pair is 
lessened by reason of the previous pair 
having occurred, an effect which may 
be referred to as associative inter- 
ference (Goggin, 1964). 

Comparisons involving from two to 
five pairs of words have been investi- 
gated by Murdock (1963). Recall for 
one pair of a sequence was tested im- 
mediately after the last pair had been 
presented. Under these circumstances 
recall was very high in the case of two 
pairs, and the biggest difference oc- 
curred when the number of pairs was 
increased from two to three. The 
higher recall in this case for two pairs 
as contrasted with that found in the 
ip ment of Peterson and Peterson 
que to stem from the fact that 

urdock tested immediately after the 
presentation of the second pair. There 
is evidence ios the last pair of a se- 
quence can be recalled i i 
close to 100% of the see qaid 
subject can tell the experimenter what 
both members of the pair were without 
cuing. If, then, two pairs have been 
presented and the subject is tested im- 
mediately by being given one member 
Of the first pair as cue, it seems un- 


likely that any genuine discrimination 
is required to give the other member of 
the pair. The subject has the last pair 
immediately available, and he sees the 
third of the four words. The fourth 
can be given by elimination. 

The main interest in Murdock's 
(1963) experiment lies in the compari- 
sons of three, four, and five pairs. Re- 
tention curves were plotted in terms of 
how many other pairs intervened be- 
tween presentation of an individual pair 
and its test. Thus, the case of two 
intervening pairs could be compared 
for all three lengths of sequence. How- 
ever, the presentation of the pair tested 
would be the first in a sequence of 
three pairs, while it would be the third 
in a sequence of five pairs. The slopes 
of the retention curves over one or 
two intervening pairs were similar for 
three, four, or five pairs in the se- 
quence. Retention dropped off rapidly. 
The chief differences lay in the asymp- 
totes reached, the level being lower the 
greater the total number of pairs. 

It may be hypothesized that the 
major effect of the competition at re- 
call that varies with number of pairs is 
on the guessing probability. With 
more responses available the possibility 
of guessing correctly is diminished. 
This point of view is similar to that 
put forward in connection with list 
learning with differing numbers of al- 
ternatives and length of list held con- 
stant. The equation for a learning 
curve has been fitted to data for one 
number of alternatives, and perform- 
ance has been successfully predicted for 
a list with another number of alterna- 
tives without any change in the learn- 
ing parameter. This was accomplished 
by taking as the guessing estimate the 
reciprocal of the number of alterna- 
tives (Bower, 1962; Estes, 1961). 

Recall failure that occurs due to 
an be thought of as a de- 


competition c hought 
ity to discriminate among 


creased capac 
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response alternatives on one basis or 
another. If the last presentation is to 
be recalled in the absence of cues, as in 
the recall of individual items, competi- 
tion of other recently presented items 
should be strong, since the discrimina- 
tion depends on recency alone. Thus, 
with successive tests performance de- 
teriorates (Keppel & Underwood, 
1962). When a basis for discrimina- 
tion other than recency is provided, as 
when a number is tested after a series 
of tests with letters, recall of the item 
of a new class is as good as recall on 
the first test of a session (Wickens, 
Born, & Allen, 1963). In the case of 
responses for which there are unique 
cues, competition should be at a mini- 
mum, for to the extent that cue-re- 
sponse contingencies have actually been 
established, competition should be ab- 
sent. This contrasts with the case 
where more than one response has been 
associated with the same cue. Here the 
cue-response contingency contributes to 
the competition. In the light of these 
considerations competition of responses 
with unique cues can be said to affect 
the guessing probability, that is, the 
subject’s ability to produce the correct 
response apart from the influence of 
the cue. 


REMINISCENCE 


The activity filling the retention in- 
terval in most of the studies mentioned 
previously has consisted of the presen- 
tation of materials of the same class as 
that testing recall. When intervening 
activity is used consisting of material of 
a categorically different kind in a non- 
learning task, marked forgetting over 
short intervals is also found. However, 
the same type of study has also found 

evidence of reminiscence with longer 
durations of this same activity (Peter- 
son, 1966). A list of five pairs of 
words was presented once, and recall 
was tested by a random sequence of 
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Fic. 2. Recall of a list of five pairs as 
a function of duration of intervening digit- 
reading activity. 


first members of each pair after from 
0-24 seconds. The intervening activ- 
ity from end of presentation to begin- 
ning of test consisted of the reading of 
random sequences of digits from a 
memory drum, The retention curves 
for three of the positions in presenta- 
tion averaged over all positions in the 
test sequence are shown in Figure 2, 
In the case of the fifth position in 
presentation the digit reading resulted 
in forgetting over the early portion of 
the retention curve, This could be at- 
tributed to interference from earlier 
pairs, since proactive interference is 
considered to increase with time. How- 
ever, such an interpretation would pre- 
dict. forgetting generaly rather than 
reminiscence, A decreasing and then 
increasing type of retention curve ma 
be inferred for the first and third posi- 
tions in presentation. These start 
lower in the figure than the fifth posi- 
tion, since the early decreasing portion 
of the retention function can be as- 
sumed to have already occurred during 
the presentation of the last pairs in the 
list. A decreasing and then increasing 
curve is also found if the last pair in 
presentation is observed only on the 
first test of the sequence. Thus, a non- 
learning activity using categorically dif- 
ferent materials was found to produce 
forgetting over short intervals and 
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' reminiscence over longer intervals. It 
.would not be expected that recall 
would continue to improve indefinitely. 
An experiment has been reported which 
found forgetting to increase after there 
had earlier been reminiscence (Keppel 
& Underwood, 1961). 

The rapid forgetting of the last pair 
could be described as due to proactive 
interference. This would follow from 
data previously described showing that 
without the presentation of the previous 
pairs there would be little forgetting. 
Somehow earlier pairs interfere with 
later pairs, as time passes since the last 
pairs were presented. The question of 
why the interference increases with 
time requires an answer. Let us as- 
sume that a recency mechanism loses 
Stored information during any inter- 
vening activity that prevents rehearsal. 
As the subject becomes unable to an- 
Swer on the basis of the short-term 
store, competition among recently pre- 
sented responses enters in to influence 
his response. If no previous pairs other 
than the tested pair had been presented, 
the subject could respond correctly 
simply by giving the only available re- 
sponse. The total number of responses 
determines the subject's response be- 
cause of the failure of the recency 
mechanism. i 

Such an explanation by itself fails to 
account for the reminiscence. Another 
mechanism would seem to be required, 
an associative learning mechanism 
which increases in effectiveness with 
time, This. may be conceptualized as 
a consolidation of the association. The 
recall test may be considered to be a 
disrupting influence which is less severe 
for a consolidated association. 

An alternative explanation in terms 
of differential forgetting might be pos- 
sible. The availability of Very recent 
presentations may diminish with time 
permitting the cue to have greater 
Weight in determining the subject’s re- 
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sponses. There is seen to be a short- 
term mechanism as well as an associa- 
tive learning mechanism required in 
this explanation, also. 


Tue Rote or TIME 


Competition has been noted to play 
a strong role in forgetting over short 
intervals, and the question of inter- 
ference or decay does not permit an 
answer solely in terms of decay. The 
term decay will be taken to mean for- 
getting which would occur no matter 
how dissimilar preceding and interven- 
ing activities were to the tested material. 
There are a number of experimental 
results which are difficult for an inter- 
ference theory to explain. For ex- 
ample, Conrad (1964) has presented 
letters visually and found confusions in 
recall which are systematically related 
to acoustic confusions in auditory pres- 
entations under noisy conditions. He 
interprets the data as showing sys- 
tematic errors even after the likelihood 
had been removed that such errors 
were perceptual. He notes that decay 
theory demands the existence of par- 
tially decayed memory traces. Such 
traces should yield incorrect memories 
which are systematically rather than 
randomly related to the stimuli which 
were present. His findings are there- 
fore considered by Conrad to be evi- 
dence for decay. 

Further evidence of forgetting under 
circumstances that are difficult for an 
interference theory has been provided 
by Keppel and Underwood (1962). In 
their first experiment a brief auditory 
presentation of a trigram was followed 
by counting activity during the reten- 
tion interval. Recall was down to 
about 50% after 18 seconds on the 
first test of the session. Keppel and 
Underwood suggest that there could 
have been proactive inhibition from ex- 
periences prior to the experimental ses- 
sion. However, in view of a finding 
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by Loess (1964) that a previous ses- 


rate. Of course, they agree in that a 
sion 3 weeks earlier with similar ma- 


slow rate is not found to be better than 


: A recall in the later 
— m in terms of pro- 
ien "inhibition does not seem con- 
vincing. The forgetting would seem 
to occur during the counting activity, 
and since this should not be interfering 
on the basis of similarity, the finding 
is evidence for decay. 

Studies of rate of presentation of se- 
quences of digits for immediate recall 
have been considered evidence bearing 

on decay theory (Posner, 1963). A 

fast rate should result in less time 

elapsing from presentation to test of a 

given element of a sequence, and so the 

finding of better recall with a fast as 
opposed to a slow rate should be evi- 
dence for decay. However, other 
factors complicate the situation, notably 
the possibility for rehearsal which is 
greater with slow rates. Some studies 
have introduced another factor by per- 
mitting subjects to determine the order 
in which they produce written recall, as 
has been noted by Posner (1964). To 
the extent that recall has been found 
to be better with a fast rate of presenta- 
tion and recall, in spite of rehearsal 
working in the opposite direction, the 
findings support a decay interpretation 

(Conrad & Hille, 1958). 

In the case of paired associates Mur- 
dock (1963) has compared conditions 

in which pairs were presented for 1, 2, 

or 3 seconds, while time for recall of 

the cued response was held constant. 

No appreciable difference in the early 

part of the retention curve was found. 
On the other hand, Peterson (1963) 
found that a 2-second rate of presenta- 
tion and test produced significantly 
better recall than a 4-second rate when 
one intervening event separated presen- 
tation from test. Thus, the early por- 
tion of retention curves from paired- 
associate experiments provides equivo- 
cal results in regard to the effect of 


a fast rate in the Steepest part of the 
curve. 

In summary, there is some evidence 
for the presence of decay, so that even 
though interference cannot be excluded 
aS a source of short-term forgetting, 
there remains the possibility that de- 
terioration with time is also involved. 
Whatever responses are available to the 
subject may compete at recall, if there 
is no basis for discrimination. "These 
competing responses must be those 
Which have not been forgotten since 
their presentation. The possibility 
exists that reduced availability of com- 
peting responses may occur in time 
whatever the intervening activity may 
be. An interpretation of this kind has 
been offered in connection with recall 
of individual trigrams (Peterson & 
Gentile, 1965). Retention was measured 
after varying periods of counting. Re- 
tention was found to be better the 
longer the interval from the end of one 
test to the beginning of the next. A 
Teasonable explanation is that competi- 
tion from the Previous items decreased 
the longer the rest interval, There was 
evidence of fewer intrusions of a single 
letter from the Previous trigram the 
longer the rest interval, However, this 
forgetting of the competing element did 
not itself seem to be due to interference, 
There is a suggestion that competition 
and deterioration with time may have 
cooperated in producing the effect. 

The present conception of interfer- 
ence can be distinguished from a dis- 
placement interpretation of interfer- 
ence. It has been Suggested (Bower, 
1964; Waugh & Norman, 1965) that 
items presented earlier are displaced 
by later items, Such an interpretation 
may be adequate for curve fitting in 
various situations, However, evidence 
as been Presented that time relation- 
ships are of Some importance. It 
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` would seem further that the finding of 
. spontaneous recovery would be diffi- 


cult to account for in a displacement 
theory. 

Duration of presentations appears to 
be of greatest importance in the later 
portion of short-term retention curves, 
where there is a positive relationship 
between duration of presentations and 
recall. Murdock (1963) has found 
that when from three to five pairs in- 
tervened between presentation and re- 
call, recall was directly related to dura- 
tion of presentation. Any effect of 
duration of the retention interval would 
Seem to work against the direction of 
this finding, since the intervening pairs 
received longer presentations if the 
pair to be tested received such, In 
Peterson’s (1963) rate experiment pre- 
viously mentioned there was better re- 
call for the slower rate when from two 
to Six events intervened between pres- 
entation and recall. However, this su- 
periority cannot unambiguously be at- 
tributed to the duration of presentation, 
Since the latter was confounded with 
time permitted for recall. 

The effect of longer durations of 
presentations in producing better per- 
formance at the asymptote of the short- 
term retention curve is in accord with 
studies of list learning in which longer 
Presentation durations have also pro- 
duced better performance (Bugelski, 
ie. Nodine, 1963). The failure to 
is an advantage for longer presenta- 

n durations early in the short-term 
retention curve, when the most rapid 
forgetting occurs, suggests that some- 
thing other than learning may be mask- 
ing learning during this period. A re- 
cency mechanism is such a possibility. 


REPETITIONS 


Repetitions of presentations involve 
amounts of time devoted to the event 
to be remembered that are multiples of 
a single presentation. If increasing the 
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Fic. 3. Recall after one or two 
presentations of a pair. 


duration of presentation improves re- 
call, it would be expected that multiply- 
ing the number of presentations should 
also improve recall. This has been 
found to be the case, as is shown in 
Figure 3. The data plotted there are 
from a study of pairs of words and 
digits presented and tested at a 2-sec- 
ond duration per exposure (Peterson, 
1963). Two pairings separated by an 
8-second spacing interval produced bet- 
ter recall than one pairing after an 8- 
second retention interval Intervals 
were filled with presentations and tests 
of other pairs. The failure of repeti- 
tion to improve retention with a single 
intervening event and the improve- 
ment at the longer retention interval 
agree with findings by  Murdock 
(1963). He tested lists of six pairs 
presented in the same serial order from 
one to three times. A strong effect of 
repetitions was found at the asymptotes 
of the retention curves, while there 
were only small differences in the ini- 
tial slopes of the curves. —— ^. 
When the effect of repetitions 1s con- 
sidered, another variable must be taken 
into account. The time interval be- 
tween repetitions has been found to be 
important in several studies (Greeno, 
1964; Peterson, Hillner, & Saltzman, 
1962; Peterson, Wampler, Kirkpatrick, 
& Saltzman, 1963). In Figure 4 data 
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Fic. 4. Recall after a 16-second retention 


interval as a function of the spacing between 
two presentations. 


have been plotted from a study vary- 
ing the spacing interval, while reten- 
tion was tested 16 seconds after the 
second presentation (Peterson, Wamp- 
ler, Kirkpatrick, & Saltzman, 1963). 
Presentation and test of other pairs 
filled the intervals. Retention was 
found to improve as the spacing in- 
terval lengthened from 0-16 seconds. 
The decrease from 16-32 seconds was 
not significant, although it suggested 
that the beneficial effect of spacing has 
a limit. 

The positive relationship between 
spacing and recall found at longer re- 
tention intervals contrasts with a slightly 
negative relationship at very short re- 
tention intervals (Peterson, Hillner, & 
Saltzman, 1962). Again, this finding 
of a differential effect of the same vari- 
able in different portions of the reten- 
tion curve suggests that there is more 
than one mechanism in short-term re- 
tention. 

The increasing spacing function at 
longer retention intervals contrasts 
with the decreasing retention function 
found over intervals of the same dura- 
tion following a single presentation. A 

paradox appears in that the best way 
to learn a pair seems to be to allow the 
earlier pairing to be partially forgotten. 
One way to resolve the paradox is to 
assume that there are two components 
involved in storage over these inter- 
vals, and that with the passage of time 
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the one diminishes while the other in- 
creases. An activity trace may dimin- 
ish over the interval, while a structural 
trace is consolidated. At time of test- 
ing after two pairings, a longer time 
during which consolidation can occur 
has elapsed since the first pairing in 
the case of spaced pairings than in the 
case of massed pairings. The variable 
of duration is thus expanded to include 
time between pairings as well as time 
during pairings. 

It was noted previously that experi- 
ments by Hebb (1961) and Melton 
(1963) indicate that a structural trace 
can be produced in a single trial, when 
interfering activity prevents any on- 
going fixation process. However, this 
does not rule out the possibility that 
permitting a consolidation Process to 
continue uninterrupted would result in 
a stronger structural trace. It would 
be assumed that the neural activity dur- 
ing a presentation persists in attenuated 
form when the presentation has ended, 
and in the present experiments that this 
activity can continue at the same time 
that the subject is engaged in other 
verbal activity not involving the same 
neural systems. Tt could not be ex- 
pected that spacing would improve 
learning when activity during the spac- 
ing interval consisted of new arrange- 
ments of the same verbal elements used 
in the learning task. Such an arrange- 
ment was designed to produce maxi- 
mum interference, and, indeed, Melton 
(1963) found the more other immediate 
memory tests that were given between 
repetitions of a given Sequence of digits, 
the slower the learning wa Digi 
comprised the interpo] ted i ar 
as the learning ta T task as well 
ao - In Studies where 
improvement has been noted with spac- 
ing of Tepetitions the interpolated ac- 
tivity has Involved elements differing 
Tom those of the learning task. In 
Such a case the consolidation would not 
€ pitted against massive interference. 
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A fixation process following learn- 
ing has been inferred from a number of 
animal studies in which traumatic in- 
terruption of the hypothesized process 
has been investigated, and from drug 
studies in which facilitation or inhibi- 
tion of the process has been reported 
(Gerard, 1963). In addition, Walker 
and his co-workers have incorporated 
the measurement of galvanic skin re- 
Sponses into studies of verbal reten- 
tion with results interpreted as evi- 
dence for consolidation (Walker & 
Tarte, 1963). These studies have in- 
volved intervals of much longer dura- 
tion than the intervals used to space 
presentations in the present studies. 
This does not preclude the possibility 
that the same process may be involved, 
but it suggests caution. 

There has been a report of an elec- 
troconvulsive shock study with a sub- 
Stantial number of humans over inter- 
Vals not differing greatly from those 
of the spacing studies previously de- 
scribed. Cronholm and Lagergren 
(1959) reported testing retention of a 
single presentation of a number when 
intervals of 5, 15, or 60 seconds elapsed 
from the end of presentation to the on- 
set of shock, Later recall of the num- 
ber was found to be better the longer 
the interval from presentation to shock. 
Nothing was said about the patient's 
verbal activity during this interval, and 
presumably he was free to rehearse. It 
would be of interest to have the results 
Of a study in which controls for re- 
hearsal were present, since the present 
application of the concept of consolida- 
tion requires that it occur at the same 
time that the subject is engaged in 
other verbal activity, 

There are alternative explanations 
of the effect of spacing presentations. 
Greeno (1964) has suggested that 
spacing aids discrimination of a paired- 
associate item from other items by in- 
troducing these other items between 


repetitions. When presentations are 
massed, a second presentation occurs 
when generalization tendencies to pro- 
duce errors are at a minimum, and so 
reduction of the generalization tendency 
will be small. While such an explana- 
tion might be relevant when cues of 
high similarity are used, it does not 
seem convincing as an explanation of 
studies in which words were used of 
low similarity with which the subject 
was very familiar. 

Another possible alternative is the 
extinction-recovery theory which has 
been advanced to explain distributed 
practice effects in the learning and re- 
tention of lists (Keppel, 1964). It is 
held that associations (contextual and 
otherwise) are extinguished during 
presentations of later pairs, but that 
they recover with time after extinction 
to interfere with recall. Permanent ex- 
tinction of competing associations is 
facilitated by distributed practice. 
Whatever the success of the theory in 
other contexts, it would not seem rele- 
vant for the present spacing experi- 
ments. Here the spacing and retention 
intervals were filled with the same kind 
of pairs. If these intervening pairs are 
assumed to produce extinction, then 
they cannot at the same time be as- 
sumed to permit recovery. 


Tur Recency MECHANISM 


Further evidence for the notion of a 
short-term store as distinct from the 
learning mechanism is provided by the 
negative action of a recency mechanism 
during anticipation (Peterson, Brewer, 
& Bertucco, 1963). It was found in a 
paired-associate learning task that sub- 
jects in proceeding through a given 
trial tended to avoid using responses 
which had already been presented on 
that trial, Presumably the subject re- 


frained from using as a guess a re- 
hich he knew must be wrong 


sponse Wi e 
f its recent presentation. The 


because o: 


Proportion correct 


| | | i n i 
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Retention interval (sec) 


Fic. 5. 
The curve is based on the equation S(t) 


ai -op - de] ) 


basis for the subject's recognition of an 
error would seem to lie in a recency 
mechanism. The action of the short- 
term mechanism during paired-associ- 
| ate learning may be characterized as a 


Running memory span for pairs. 


kind of running memory span. It is 
known that in a situation in which in- 
formation enters the organism se- 
quentialy, the subject can report ac- 
curately, in order, the most recent 
digits or words that have been pre- 
sented (Pollack, Johnson, & Knaff, 
1959; Postman, Turnage, & Silver- 
stein, 1964). It would appear that the 
subject has a similar possibility for re- 
call in the continuous paired-associate 
situation as well. A test shortly after 
presentation should permit accurate 
recall on the basis of the Tunning 
memory span without an association 
having been established. The results of 
an experiment * designed to investigate 
this possibility are summarized in Fig- 
ure 5. Each of 20 students read aloud 
a sequence of pairs consisting of a word 
and a number presented at a 2-second 
rate ina memory drum. No tests were 
included among the sequences of pairs. 


3Lawrence Clayton assisted in this ex- 
periment. 
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The drum stopped after from 8-12 
pairs had been presented, and a sub- 
ject was given 10 seconds to speak all 
of the pairs that he could remember. 
Ten such lists of uncertain length were 
presented and tested with each subject. 
The recalls were scored in terms of the 
number of exposures of other pairs 
separating presentation of an individual 
pair and the beginning of the recall 
period. The ability of the subject to 
correctly recall a pair is seen to be 
very high immediately after presenta- 
tion and to drop off quickly thereafter. 
It seems reasonable to assume that the 
mechanism that the subject uses under 
conditions of free recall can also be 
used when a cued recall is being tested. 
The sequence of previous presentations 
is immediately available to the subject 
for a few seconds in a short-term 
storage. 

Note that the short-term store dis- 
cussed here is a postperceptual store. 
Subjects read items aloud, and the re- 
tention interval is measured from that 
point. There is a distinction to be 
made between this short-term mecha- 
nism and that conceptualized by Sper- 
ling (1963) as the visual information 
storage. In Sperling’s conceptualiza- 
tion the subject’s rehearsal of informa- 
tion places it in an auditory infor- 
mation storage. The rapid decay of 


Proportion correct 


" yew 
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Retention intervat (see) 
ee components of recall, R(t). 
es the short-term component, 


s the learning component, and 
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information that is found in his experi- 
ments using tachistoscopic presentation 
is assigned by him to the visual infor- 
mation storage. 


INTERACTION OF COMPONENTS 


Three components have been dis- 
cussed in connection with the short- 
term recall of paired associates. A 
model of their interaction follows. The 
empirical probability of a correct recall, 
R(t), is considered to be the union of 
the probabilities associated with the 
three components. 


R(t) = S(t) + L(t) [1 — S(4)] 
+G[L-—S(t)] [1 - L(0)], 


where S(t) is the probability of a cor- 
rect recall on the basis of the short- 
term mechanism, L(t) is the proba- 
kai of recall by action of the learning 
€chanism, and G is the probability of 
fs E correctly. The equation gives 
p Fus] that a correct recall will 
cia rough the operation of at least 
p of the three components. 

an of the model to some 
pe een made in Figure 6 to il- 
: strate the interaction of the compo- 
Ls " rs consisting of a word and 
pe er irom 1-12 were presented in 
a mory drum and tested after an 
interval filled with presentations and 
tests of other pairs. A 2-second rate 
m: used. Primacy effects were con- 
rolled by the use of dummies in initial 
oe of sequences. Position in the 
“te Bi was balanced out over a num- 
nt €quences. The empirical points 
ne means of 12 subjects, each 
tested 32 times at each retention in- 
terval. The curve R(1) has been fitted 
to the data points by a trial-and-error 
process. The contribution of the short- 
term mechanism, S(t), is represented 
by the same curve used with free re- 
call of pairs in Figure 5. The con- 
tribution of the learning mechanism, 


* Richard Wampler assisted in this experi- 
ment, 
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L(t), has been made similar in form 
to a curve which would fit the data 
points for different lengths of spacing 
intervals in Figure 4. The guessing 
probability has been taken to be the 
reciprocal of the number of response 
alternatives. 

The equations for S(t) and L(t) 
were chosen so that their combination 
with G would yield a curve giving a 
reasonable fit to the data points. The 
curves for the three components are, of 
course, hypothetical and not determined 
by the data of this experiment. Their 
forms have been inferred from the 
other experiments shown in Figures 4 
and 5. 

As Figure 6 illustrates, an increasing 
contribution of the learning mechanism 
can be hidden by a rapidly dropping 
short-term component. The learning 
component was presumed to have in- 
creased to some positive value by the 
end of the presentation, and thus it is 
greater than zero at the beginning of 
the retention interval. The point of 
view underlying the conceptualization 
is that recall is a search process which 
requires time. The strength or avail- 
ability of a response due to the in- 
fluence of one of the mechanisms is re- 
flected by a latency distribution. The 
parameters of the distribution vary 
with time. Those of the short-term 
component result ina distribution whose 
mean becomes greater with time. In 
the case of the learning component the 
mean of the distribution decreases with 
time. The length of the recall period, 


2 seconds in the present experiment, 


sets a limit on the maximum latency 
A condition 


which can be successful. 

whose latency distribution for the 
short-term component lies entirely 
within the recall period will have 100% 


correct recall. A retention condition 
whose range of latencies extends be- 
yond the recall period will show for- 
getting. It follows that less would be 
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forgotten if a longer recall period were 
provided, and hence it is not appropri- 
ate to speak of a pairing having been 
forgotten, or lost once and for all from 
storage. By this reasoning there is 
avoided the implausibility of a con- 
solidation process resulting in some- 
thing which had been lost later being 
produced out of nothing. Rather, it is 
possible to conceive of a trace which 
was present from the beginning being 
strengthened or becoming more readily 
available in the sense of requiring less 
time to be found. 

It is not claimed that with unlimited 
recall time the subject would eventually 
hit upon the correct response. The re- 
call period may itself be a source of 
interference. Further, the possibility 
of the subject stopping the search with 
an incorrect recall may increase with 
time. It is considered that with a short 
recall period this problem may for 

) present purposes be ignored. 


SUMMARY 


The position taken in this paper ac- 
knowledges that short- and long-term 
retention have much in common. In- 
terference produces forgetting in both 
cases, and repetition generally improves 
retention. However, it is considered 
that in the case of paired associates for 
a brief period after a presentation a 
recency mechanism interacts with the 
learning mechanism. Several variables 
affect the retention curve 
during the early declining 
contrasted with the asymptote. Among 
these are duration of presentations, 
number of repetitions, and duration of 

the spacing interval between repeti- 
tions. The short-term or recency 
mechanism is conceptualized as a post- 
perceptual mechanism whose effective- 
ness decreases through time and the ac- 
tion of other information. Recall for 
very recent events is characterized by 
rapid availability, by which is meant 


differently 
phases as 
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search times which can be effective in 
short recall periods. The long-term or 
learning mechanism is conceived of as a 
relatively stable store. There are in- 
dications that an increase in the avail- 
ability of information on the basis of 
this component occurs for a period of 
time after presentation. In empirical 
retention curves the increasing com- 
ponent may be hidden by the rapidly 
decreasing short-term component. The 
presence of the former has been inferred 
from the beneficial effect of increasing 
the time between repetitions of a pair- 
ing and from the reminiscence found 
after irrelevant intervening activity. 
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A stochastic theory is presented to (a) account for a speaker's 
selection of a linguistic response to distinguish a referent stimulus from 


nonreferent stimuli, and (b) predict the 
using the speaker’s response, correctly identifies the referent. 


probability that a listener, 
The 


speaker’s response is postulated to result from the concatenation of 2 


hypothetical psychological 


Stages, termed “sampling” and “com- 


parison.” The listener’s identification of the referent is postulated to 
result from a 1-stage process similar to that of the speaker’s compari- 
son stage, Results from several interrelated experiments are reported 


which provide support for the basic assumptions 


of the theory, 


Psychological processes inherent in classical word association and in 
recognition and recall are interpreted in terms of the speaker and 


listener theory. 


Philosophers, linguists, and psychol- 
ogists generally find it necessary to in- 
troduce the concept of the referent, or 
closely related concepts, in their dis- 
cussions of the semantic properties of 
language (e.g. Brown, 1958; Kantor, 
1936; Morris, 1946; Ogden & Rich- 
ards, 1923; Quine, 1960; Russell, 1921 ; 
Skinner, 1957). The concept of the 
referent includes the objects, properties, 
events, or relationships about which a 
person talks, writes, or gestures. Ref- 
erents may be particular stimuli or 
classes of stimuli. Various writers in- 
clude in the class of referents not only 
stimuli that are present in the im- 
mediate environment of a speaker or 
listener, but also objects and events 
that are removed in time or place, and 
even totally fictional constructions. In 
the present paper we restrict our in- 
quiry to referents that are identifiable 


1 Supported in part by NSF Grant GS-179 
to Bertram D. Cohen. We wish to acknowl- 
edge the valuable suggestions of Saul Stern- 
berg who read an earlier draft of this paper. 
We also wish to thank Ernst Rothkopf for 
suggesting a number of the methodological 
refinements we used in collecting word- 
association data. 


stimuli in the immediate environment 
of a speaker and a listener. With 
this restriction the psychological analy- 
sis of relations among the speaker's 
referent, the speaker's linguistic act, 
and the response evoked in the listener 
by the linguistic act becomes an empiri- 
cally tractable problem. 

The proposition, held by philosophers 
for centuries, that words do or should 
stand in a fixed relationship to their 
referents, is almost useless in the 
psychological analysis of human com- 
munication. Most referents have many 
names, and many referents can be 
called by the same name. A speaker’s 
specific utterance is conditioned not 
only by his immediate social, physical, 
and linguistic context, but also by his 
linguistic development, previous inter- 
actions with the listener, perceptions of 
the listener, and by the particular re- 
Sponse he intends to evoke from his 
listener. It is our contention that even 
when the effects of these variables are 
identified or controlled, there remains 
an Important degree of unpredictability 
in the speaker’s choice of a specific 
utterance; that is, the utterance of the 
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speaker is a probabilistic choice from 
among a set of responses in the speak- 
er’s repertoire, each of which is more- 
or-less adequate as a linguistic cue for 
his referent. The process by which a 
particular utterance is selected and 
emitted is termed the referential process 
of the speaker. 

The interpretation of the speaker’s 
utterance by a listener at any given 
time—or, more specifically, his identifi- 
cation of the speaker’s referent—is also 
assumed to result from a probabilistic 
process. We term this the referential 
process of the listener. This is not to 
say that the listener’s immediate envi- 
ronment, linguistic development, previ- 
ous interactions with the speaker, per- 
ceptions of the speaker, and goals do 
not also play an important role in 
determining his reaction to the specific 
utterance of the speaker.’ 

One of the most serious handicaps in 
the investigation of referential processes 
a Ls sgn human communication is 
s js the Speakers referent, even when 
eh me immediate environment, is 
Deis Lo entifiable only by inference 
us e Pen response. This 
a o identifying the referent 
whict any analysis of the process 

hich links the referent to the speak- 
er s response a circular one. Further- 
ot the identification of the referent 
ME e Bev response may be un- 
iie our conjecture about the 

c nature of the process is 
correct, 
uum n in identifying the 
horough crie de pads s 
eh de copies accusacy of the 
a i cision process in referent 
identification. Added to this difficulty 


2“Speaker” will be used in the generi 
Sense in this paper to refer to a DERIO 
Sat itinicative act is oral, written, or gestu- 

- “Listener” will be used in a similar 
Eeneric sense. 


is the fact that overt discriminative 
listener reactions to the speaker’s ref- 
erent often do not (or cannot) occur 
at all in conversation or in reading. It 
is not surprising, therefore, that the 
investigation of referential processes 
by psycholinguists has been primarily 
anecdotal—a few examples of referents 
and linguistic responses typically bear 
the burden of relating theory to lin- 
guistic events. 

The purposes of the present paper 
are to (a) describe an experimental 
paradigm in which the speaker’s ref- 
erent is identifiable independently of 
the speaker’s behavior, (b) describe 
the main features of a stochastic theory 
of referential behavior, and (c) sum- 
marize a series of experiments designed 
to test basic propositions of the theory. 
A number of implications of the theory 
for certain well-establised verbal phe- 
nomena will also be sketched. 


EXPERIMENTAL PARADIGM 


The basic features of the experimental 
paradigm are as follows. A referent stimu- 
lus is assigned to the speaker by the experi- 
menter. The speaker’s task is to provide a 
linguistic response that makes it possible for 
his listener to identify the referent stimulus. 
Given the speaker’s response, the listener’s 
task is to identify the referent. 

An important additional feature of the 
situation is the fact that the nonreferents 
from which the referent is to be distinguished 
are made explicit to the speaker. Also the 
listener is given an explicit set of stimuli 
from which to choose the speaker’s referent. 
The use of well-defined stimulus sets for both 
speaker and listener is a convenient simplifi- 
cation of the task and not a necessary 
feature of the experimental investigation of 


referential behavior. : . 
In the studies described in this paper, 
the experimental paradigm was realized An 
the following manner. The speaker ant 
listener were both shown pairs of words, one 
word pair at a time. The ME 
designated one member of each word pair 
erent and the other as 


eaker's refi 1 er c 
2 utem: Note that in these initial 


studies the referents and nonreferents were 
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themselves words. The speaker was a 
ted to provide a one-word clue (the 
US ) that would enable the 
speaker response d cec 
listener to identify the referent wori "d 
rectly. The speaker was told to use a “clue 
word that has some understandable connec- 
tion with” the referent and not to use any 
“clue word that sounds like or is spelled like 
either of the stimulus words. The listener 
was instructed to identify the referent word 
from the two alternative possibilities using 
the speaker response. 3 

Consider, for example, the synonym pair 
WOMAN-LADY with LADY as the referent. 
Examples of permissible and effective (for 
listener’s referent identification) speaker re- 
sponses include tramp, title, Chatterly, 
finger, and pink. A permissible but proba- 
bly ineffective speaker response is female. 
Most of the word pairs used in the present 
series of studies were synonym pairs. The 
choice of synonym pairs was motivated by 
the thought that the responses associated 
with the referent are likely to overlap con- 
siderably with those of the nonreferent and 
that the effects of the nonreferent on speaker 
behavior are most salient under conditions 
of high overlap. A number of unrelated 
word pairs, for example, BUY-ATTACK, were 
also included. The method is, of course, 
amenable to almost any type of word pair, 
for example, supraordinates (ANIMAL-DOG), 
coordinates (noc-car), pairs related in ex- 
perience (TABLE-CHAIR), etc, 

The experimental method is also readily 
adapted to nonverbal stimuli, more than one 
nonreferent stimulus, and complex speaker 
responses (phrases, sentences, etos A 
number of investigators (Carroll, see Os- 
good & Sebeok, 1954, p. 200; Krauss & 
Weinheimer, 1964; Maclay, 1962; Maclay 
& Newman, 1960; Werner & Kaplan, 1963) 
have devised and used experimental situa- 
tions for studying the communication of non- 
verbal stimulus objects (blocks and pegs 


3 Writers concerned with referential be- 
havior often distinguish between words 
(signs) and "things" (referents). From a 
psychological standpoint, it is important to 
maintain a distinction between speaker re- 
sponses and referents. Any stimulus, how- 
ever, can function as a referent. There is 
no reason to restrict the term referent to 
nonverbal stimuli. Thus, spoken or printed 
words, as well as nonverbal stimuli, can and 
often do function as referents. The advan- 
tage of using verbal rather than nonverbal 
referents in initial studies will be made 
apparent later in the paper. 


varying in shape, size, and color; simple 
geometric forms; novel designs; olfactory 
stimuli). The speaker's task is essentially 
to describe one of the objects so that the 
listener can select it from among a set of 
alternatives. Lantz and Stefflre (1964) re- 
cently proposed that the accuracy with which 
a color is communicated to listeners be used 
as a measure of the codability of the color. 
This measure was found to be the best of 
several measures of stimulus codability in 
terms of its ability to predict the ease with 
which the colors could be remembered in 
recognition tasks. To our knowledge, the 
model we propose is the first systematization 
of the psychological processes governing 
speaker and listener behavior in this type of 
communication task. 


THEORY 


Consider a population of subjects in 
which differences among them in their 
subcultural backgrounds have been 
minimized, That is, all speakers and 
listeners have been drawn from a pop- 
ulation of persons who are similar in 
age, education, and language abilities. 
It is also convenient, at this point, to 
restrict our attention to speaker-listener 
pairs who, though similar to one an- 
other, are strangers. This restriction 


is imposed in order to minimize the 
special effects on communicative be- 
havior that various types of inter- 
personal relationships might produce. 
How shall we account for the vari- 
ability among these speakers in the 
responses they emit to communicate a 
referent? Why do some, but not all, 
of the listeners correctly identify the 
referent given a particular speaker 
response? 

One approach is to account for the 
behavioral variability of speakers and 
listeners in terms of the similarity in 
meaning of the referent and nonreferent 
words—or, in the case of nonverbal 
stimuli, by similarities in their names 
or in their physical properties. "Thus, 
one might expect that both the number 
of different Speaker responses given by 
a group of subjects and the probability 
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of a listener error are positively related 
to the similarity in meaning of the 
referent and nonreferent. Measures of 
meaning similarity can be derived from 
semantic-differential measures or from 
the overlap of word associates (or 
names) given to the referent and to the 
nonreferent stimulus word (Marshall 
& Cofer, 1963). How can these meas- 
ures be used to predict details of 
Speaker and listener behavior, for ex- 
ample, the specific set of speaker re- 
sponses and their probabilities, and the 
probability of a listener error for any 
particular speaker response? A con- 
ceivable approach (when the stimulus 
array consists of words) is to obtain 
an overlap measure between each of the 
stimulus words and each of the pos- 
Sible speaker responses. Speaker re- 
ies could then be ordered in terms 
ad a apes relative overlap with referent 
5 ronne erent ; from this ordering 
ae ae able to estimate the dis- 
S n of speaker responses and, for 
pene speaker response, predict the 
AR m of a listener error. Un- 
hee ^ ely, this approach is unwieldy 
Do emis the collection of word- 
ee a data for each of the speaker 
want es v" for each of the stimulus 
Pp ra € communication task. It 
Bre Chg circular in that it re- 
rol Bua identify the speaker 
probabilities, ore we can estimate their 
m Eon approach is considered 
miguesied Bae rom preliminary work 
ilo at standard word-associa- 
rms for the referent and non- 
referent stimuli alone are suffici f 
redict detail ein 
P J Hi of speaker and listener 
behavior using the stochastic the 
outlined below. The essential n. waded 
of the theory are as follows. aati 
It is postulated that the speak- 
€r's referential process is a concatena- 
tion of two hypothetical psychological 
Stages, termed sampling and compari- 


son. Both stages are assumed to be 
probabilistic. The speaker’s referential 
process begins with a sampling stage 
in which a response from the speaker’s 
repertoire is selected randomly from 
the distribution of word associations 
to the referent stimulus alone. The 
distribution of associations to the ref- 
erent is denoted J,. In the comparison 
stage, which always follows sampling, 
the relative “associative strengths” of 
each of the stimulus words to the 
sampled response determines the con- 
ditional probability that the speaker 
will utter the sampled response. The 
associative strengths of the referent 
and nonreferent to a sample response, 
i, are denoted s;(r) and si(m), te- 
spectively. If, as a result of the com- 
parison stage, the sampled response is 
rejected as an inadequate communica- 
tion, the sampling-comparison cycle is 
repeated. Ultimately the speaker emits 
a response, thus terminating the proc- 
ess. A diagram of this hypothetical 
process is given in Figure 1. 

It is assumed that the listener's 
choice is the result of a probabilistic 
process which is a function of the 
associative strengths of the speaker 
response to each stimulus word. For 
the listener the associative strengths of 
the referent and nonreferent to a 
speaker response i are denoted L(r) 
and (n), respectively. The probability 
that the listener will correctly identify 
the referent is determined by these two 
associative strengths (see Figure 1). 

The referential process of the listener 
is assumed to be similar to the com- 
parison stage of the speaker. Put an- 
other way, the speaker essentially takes 
the role of the listener in the ernie 
comparison stage—a notion reminiscen 
of Mead's ( 1954, p. 147) use of the 
“generalized other” in discussing lan- 


age. : 
The formal theory is not necessarily 


limited to verbal stimuli. The choice 
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sir), si(u) 


response 7 


REJECT sampled 
response 2 
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WORD PAIR r,n;. 
SPEAKER RESPONSE ; 


— COMPARISON: 


a of r as referent, 


lic), li:(n) 


Fic. 1. 


of verbal, rather than nonverbal, stimuli 
is particularly advantageous for these 
initial studies if word-association data 
provide satisfactory estimates of the 
relevant quantities used in the theory. 
Nor is the theory necessarily limited 
to single-word speaker responses. 
Phrases, sentences, etc., may be per- 
mitted as speaker responses. However, 
the operations analogous to word-as- 
sociation techniques for nonverbal 
stimuli and for multiple-word re- 
sponses are not yet well developed. 


Speakers Sampling Stage 


A response belongs to J, if there is 
a nonzero probability that it will occur 
as a word associate to r. The prob- 
ability P,(i) of sampling response i in 
I, is not uniform for all responses in 
I, as it might be in sampling from a 
section of a thesaurus. Since the 
speaker must sample from J, more 
than once if a sampled response is 
rejected in the comparison stage, the 
question arises concerning the prob- 
ability of again sampling the rejected 
response. A plausible and mathe- 
matically tractable assumption is that 
sampling is done with replacement so 


CHOICE of n as referent 


Sketch of the speaker and listener processes. 


that Pzí,() = Pzt1(i) = P,(i) for all 
t and i. The additional subscript t 
indicates the ith sampling which has 
followed t — 1 rejections of i. Our 
subjective experience with the speak- 
er's task suggests that in searching for 
an adequate response he may consider 
the same response more than once 
and may, in fact, emit it after having 
rejected it one or more times. 


Speaker’s Comparison Stage 


Each response i sampled by the 
speaker defines, for a given referent- 
nonreferent pair, a pair of scale values, 
si(r) and s(n), respectively. When 
there is a high degree of overlap be- 
tween the associative responses to the 
referent and nonreferent, as is the case 
with synonyms, the difference between 
Si(r) and s(n) will be relatively small 

4 Another 


pling is don 
> Phi) 


at time ¢, 


plausible scheme is that sam- 
e with replacement, but P,,+(i) 
if i was sampled and rejected 

A third alternative is that 
Py, t+1 (i) =0 if i was sampled and rejected 
at time t. These two sampling schemes re- 
quire an additional rule for readjusting the 
Probabilities of sampling the other responses 
in [. In the third sampling scheme, a 
speaker may exhaust 7, unless I, is infinite. 


= 
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for many of the sampled responses. 
The closer the values of s;(r) and 
si(n), the more likely it is that the 
speaker will make an incorrect decision 
in the comparison stage. For example, 
a speaker might emit a sampled 
response i although the associative 
strength of stimulus r to response i is 
less than that of 7 to i. 

As we have already noted, a speaker 
might reject response i at one time 
and, having sampled i again at a later 
time, might then emit it. It is ex- 
plicitly assumed that the conditional 
probability of emitting a sampled re- 
sponse is unchanged by previous in- 
stances of sampling and rejection. 

The identification of the comparison 
stage as a decision process suggests a 
number of models that have been de- 
veloped for choice behavior and com- 
parative judgment. A prominent model, 
applicable to the speaker’s comparison 
Stage, is Thurstone’s model of com- 
Parative judgment. According to this 
model, speaker comparison consists of 
Fleeting a value from each of two 
; aper distributions of associa- 
he strength, say Yr and Yn; if Yr > y» 
: Speaker emits the sampled re- 
Ponse; otherwise he rejects the re- 
Sponse and samples again from 7,. A 
peris model, the one we propose to 
= “A y on speaker’s comparison stage, 
Terr 9 agro by Bradley and 
TE ?- In spite of the formal 
snos (G etween the Thurstone 
qe ae V) and the Bradley- 
: 1odel, they are almost equivalent 
in practice (Torgerson, 1958 p. 20 

l'orgerson, 1958, p. 202). 

The essential notions of the Bradley- 
Terry model are simple and mue ht 
forward and require fewer ampo : 
in practice than the Thurstone cry 
Another desirable feature of the 
Bradley-Terry model is its link to 
Luce's (1959) theory of choice be- 
havior, The usefulness of this link will 
€come apparent when we extend our 
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analysis of referential behavior to 
multiple nonreferents. 

The application of the Bradley-Terry 
model requires that s;(7) and s;(7) be 
restricted to values equal to or greater 
than zero for all i, r, and n. Let Pe(iļi) 
be the conditional probability of emit- 
ting response 1, given that i is sampled. 
Then, for P;(i|i) 40,1, 


P.(ili) = si)/Essr) + si]. OJ 


What remains to be done is to specify 
precisely how the associative strengths, 
si(r) and s;(7), are obtained. A rela- 
tionship is postulated between associa- 
tive strength and associative probability 
obtained from word-association data. 
This relationship is given below in the 
section on Estimation of Theoretical 
Quantities. 


Asymptotic Distribution of Speaker 
Responses 


If the probabilities P,(i) and P, (ili) 


„remain unchanged with repetitions of 


the sampling and comparison stages 
(as assumed in the previous sections), 
then the entire speaker process is a 
stationary Markov chain. There is an 
observable absorbing state correspond- 
ing to the emission of each response 1n 
I, and a single unobservable transient 
state, T, corresponding to the rejection 
of any response in the comparison stage. 

Thus, the probability that a speaker 
emits any given response as à result 
of a single sampling-comparison cycle 
is the product of two probabilities : the 
probability that the response 1$ sam- 
pled, and the conditional probability 


that it is emitted given that it is sam- 


pled. That is, the transition prob- 


ability : 

P,{Ei,u-1 (emission of response ? — 
at time ¢ + 1)! Ti} = P.) P«G]|2). 

er has not emitted a 


nd eak d ead: 
Now, if the sp! Cr pility of rejecting 


response, the pro 
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any response in I, following a single 
sampling-comparison cycle is: 


PATT} = 1- X POPS. 


The initial probability vector for the 
states T, Ex, Es, ... is [1,0,0,.. -]. i 

If the speaker is given an unlimited 
time to emit a response, then it can be 
shown (Kemeny & Snell, 1960, pp. 
43-68) that the asymptotic probability 
of emitting response 1, 


pi P.G)P.GH)/ x P.G)P4GU- [2] 


Listener's Comparison Stage 


The application of the Bradley-Terry 
model to the listener's identification of 
the referent is as follows. Let x and y 
denote the two stimulus words, and let 
P,(x,y) denote the probability that a 
listener identifies x rather than y as the 
referent given speaker response i, Then 
if P;{x,y} 240,1, 


Pifxy} = hG)/DG) + Lo). [3] 


Equation 3 has the same form as Equa- 
tion 1. The associative strengths U(x) 
and (y) are always restricted to 
values equal to or greater than zero. 
We have, however, deliberately in- 
troduced different notation (J and s) 
to refer to the associative strengths 
in the listener’s and in the speak- 
er’s comparison stages. Although 
the speaker’s comparison stage is con- 
ceptualized as one in which he es- 
sentially takes the role of the listener, 
the associative strength between any 
given stimulus x and response i in the 
speaker's comparison stage may not be 
equal to that involved in the listener's 
comparison stage. It would, however, 
be expected that (x) and si(x) are 
monotonically related. Estimation of 
L(x) and (y) from associative prob- 
ability is given in the next section. Tt 
may also be noted that P;{x,y} in 
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Equation 3 can be estimated directly 
from listeners' data (providing they are 
homogeneous), whereas Pe(iļi) in 
Equation 1 is a theoretical quantity 
which cannot be directly estimated 
from speakers' response data. 


Multiple Nonreferents 


A generalization of the theory to 
communication involving more than 
one nonreferent will now be sketched. 
Consider, first, the referential process 
of the speaker. We retain the assump- 
tion that the effects of the nonrefer- 
ents on the choice of a speaker re- 
sponse take place in the comparison 
stage only. Thus the sampling stage 
remains unchanged for multiple non- 
referents. A generalized version of the 
Bradley-Terry model (Luce, 1959, p. 
23), applied to the speaker's compari- 


son stage, is as follows, If Pelli) A 
0,1, then: 


P.(ili) = 80/2: si), [4] 


where X is the set of referent and 
nonreferent stimuli, 

The extension of the listener’s ref- 
erential process to a choice from among 
more than two stimuli takes the same 
form as that for the speaker’s com- 
parison stage. That is, the probability 
that the listener will identify any xeX 
as the referent is given by: 


lia) E L(y). 
vX 


However, since Luce’s (1959) gen- 
eralization of the Bradley-Terry model 
15 a consequence of Luce’s choice 
axiom, it is also of interest to consider 
the possibility that the choice axiom 
itself (sometimes called the constant- 
ratio rule) also holds for listener 
behavior in our experimental paradigm. 
For an empirical test of the choice 
axiom we considered the postulated 
relationship between two-choice and 
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three-choice data. That is, the follow- 
ing form of the choice axiom was used: 
If Pi(x,y) 240,1, then: 


Pi{x,y,3} 
Pi{x,y,2} + Pilyss] ' 


where Pi(x,y,z) and P;(y,v,s) are the 
probabilities of choosing x and y, re- 
Spectively, from the stimulus ‘array 
x,y,z. (The last two stimulus ele- 
ments in brackets can be written in 
te order.) Equation 5 says, in 
ves ae the probability that a lis- 
een jane x as the referent when 
tie, de y are the alternative possibil- 
«d he same as the conditional prob- 
: ia y of choosing x from x and y 
Eur le and z are available; the 
at oa is, of course, the 
fnt te oth cases. It should be noted 

nis test of the model is parameter 


free ; £ 
ree and does not require word-associa- 
tion data. 


Pifxy}= [5] 


Estimation of Theoretical Quantities 


"ees guida discussed in the preced- 
E Ei ections apply to the behavior of an 
gr ividual. However, in order to test 
these models, it is necessary to use 
poured data. The use of grouped 
M^ implies a number of intersubject 
M i cp ind assumptions. In order to 
uc een homogeneity assumptions, 
SR Me selected for experimental 
bi cate theory were restricted to 
ae ergraduates with relatively 
wes ar academic interests and within a 
a Eis age range, 
18 assur 
that a inest fnenprobebility 
im ee chooses a stimulus as 
rent, for a given stimul 

ray and speaker response, is th E. 
for all listeners. Thus, the m er 
of listeners who, given pelas oe 
i, chose x from an x,y array rdum 
an x,y,z array was used to SEak 
pley) and Pi(x,y,z), respectively. 

is also assumed that the propor- 


tion of subjects who give i as a word 
associate to stimulus v in a standard 
word-association situation is an ade- 
quate estimate of Pz(i), the prob- 
ability that is sampled by a speaker 
during the sampling stage, given x as 
the referent. 

Word-association data also enter 
into the calculation of associative 
strengths for the speaker and the lis- 
tener. In the case of the listener, it 
is assumed that for any stimulus -, 


lx) = aEP&G) +c}, [6] 


where a, b, and ¢ are constants, and 
where P. (1) is estimated by the propor- 
tion of subjects who give response i to 
stimulus x in the word-association task. 
The same power function is also as- 
sumed to hold for the speaker's as- 
sociative strength s,(x), but possibly 
with different constants, that is, s(x) 
= A[P,(i) + C]?. 

The use of a power transformation 
is suggested by analogy of P,(i) with 
a physical continuum, that is, from the 
power law suggested by Stevens (1957) 
for psychophysical data. Whether or 
not the analogy is defensible, the ap- 
propriateness of the power transforma- 
tion for the present data is an empirical 
matter. This and other transforma- 
tions were examined and compared 
using listener data. The results of 
these comparisons are given in the 
section on Tests of the Listener Model. 


Metareferential Behavior 


The experimental paradigm and pos- 


sibly the theory can also be extended 
to the study of speaker and listener 
responses that are typically found in 


t necessary to assume 


that the probability with which an F 
vidual subject gives i as a word eens 
to stimulus x is the same as the P.(i) for 
that subject. We shall comment further on 
this issue below in Intersubject Homogeneity 
in Word Association. 


5 Note that it is no 
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human communication as aay to 
or consequences of the referential unc- 
tion. For example, a listener may 
reject a speaker's referential response 
as inadequate ("I don't understand."), 
solicit additional speaker behavior 
(“Tell me more about it."), or confirm 
a tentative guess about the referent by 
supplementing the speaker's response 
(Do you mean the red book?"). The 
relationship of these metareferential 
functions (or metalingual functions as 
they are termed by Jakobson, 1960, 
p. 375) to the referential function can 
be studied experimentally by suitable 
additions to the responses available to 
the listener. Evidence from a pre- 
liminary study suggests that the prob- 
ability that a listener rejects a speaker 
response as inadequate for the identi- 
fication of the referent is positively 
related to the similarity between the 
associative strengths of the speaker 


response to each of the two stimulus 
words. 


COLLECTION oF Worp-AssocraTION 
DATA 


Word-association data were collected from 
544 male undergraduates from two intro- 
ductory psychology sections and one intro- 
ductory music section at Rutgers University. 
Two hundred and seventy-two subjects were 
given one member from each of 153 synonym 
pairs as stimuli; the remaining 153 words 
were given to the 272 additional subjects. 
Subjects were tested in groups of 100-200 
students each. 

Each subject was given a deck of IBM 
cards consisting of instruction cards and 153 
cards with a different stimulus word printed 
on each card. The stimulus words given to 
each subject were in a different random 
order. 

Subjects were asked to print one and only 
one word on each card—“the first word that 
comes to mind” for each stimulus word. 

Each response was punched on the same 
IBM card that the subject used. Spelling 
errors were corrected. Response-frequency 
counts for each of the 306 stimulus words 
were obtained directly from the punched 
cards with the 7090 computer. 
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TESTS OF THE LISTENER MODEL 


The two-person situation, in which 
each listener is assembled with only 
one speaker, is an inefficient way to 
test the various assumptions and con- 
sequences of the listener model. From 
the standpoint of obtaining stable esti- 
mates of Pi{x,y} or Pi(s,y,z), the most 
desirable set of data is one in which a 
sizable number of listeners is given the 
same speaker response for any given 
stimulus array. Furthermore, when 
listener behavior is linked with word- 
association data, it is desirable to base 
parameter estimates (e.g., estimates of 
a, b, and c in Equation 6) on a large 
number of +,y,i triples which vary over 
a wide range of P,(i) and P,(i) values 
and combinations. In short, the fortu- 
itous set of speaker responses provided 
by each speaker to his listener does not 
have these desirable properties, 


Listener Behavior and Associative 
Strength 


Instead of a two-person situation, 
110 male undergraduates (not previ- 
ously used) were all given the listener 
task with the same set of 256 xyi 
triples. Each triple consisted of a 
synonym pair and a speaker response. 
The subjects were run in one of four 
groups of 25-30 persons, and each 
group was given the 256 word triples 
in a different random order, Several 
synonym pairs appeared more than 
once in the list but with different 
speaker responses. Each word triple 
was projected on a screen for 8 seconds. 

Subjects were told that speaker re- 
sponses had been obtained from college 
students during a previous semester 
and that their task as listeners was to 
identify which of the two Synonyms in 
each word pair had been the speaker's 
assigned referent. Listeners were ac- 
tually given speaker responses system- 
atically selected from among the word 
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associates of one or both members of 
eas pairs in order to obtain a 
b e range of P,(i) and P,(i) values. 
he relevance to the two-person situa- 
tion of listeners’ data obtained from 
this study depends on the extent to 
Which actual speaker responses to a 
referent are, in fact, the same as word 
Mri to that referent—as the 
pie ET model assumes. As we shall 
lass p^ — concerned with speak- 
Bi E rei — y and word as- 
rre a referent do overlap con- 
T y wwe. form of the model for 
Eaten er is as follows. If 
don 3. ad 3 substituted into Equa- 
— s the appropriate algebraic 
: ns are made, the following 


linear e 

r equation wi i i 
5 wi 

obtained: th zero intercept 1s 


log CPilx,y}/Pifyx} ] 
=b log {LP.(i)+cV/[P,(i)+e]}, [7] 


: mar =1—P,{x,y}. The 
Amps A of listeners who chose x was 
eee a - pecunie of P,(x,y) in Equa- 
s Mr oa: proportions of times 
Word associat vem desertae 
as atte Fa 4) and Po) rà 
qn in Mrd zu RU 
Nea ar least-squares procedure 
rr he following estimates: 
aie creda wane These estimates ° 
the notation ín “1 word triples with 
Synonym so one ‘Poy denm 
triple w. 3 pa a are 

as deleted because P,{x,y} = 


1.00. Fi t 
igure 2 is a plot of the data with 


the fitted line having asl * 
There is no indication s ope equal to b. 
curvilinearity in the s iil jon 
a : erved relation- 
i 5 Since a slight linear relati ation 
ke exist between values of the i d 
ariable and the variance of the Sarpedon 
Bas also estimated by the method values, b 
wes (Snedecor, 1946, pp. 105-108). "Tic 
resulting b was .69. b The 


onship appears 
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Fic. 2. The use of a single-valued func- 
tion of P.(i) and P,(i) derived from the 
comparison model to relate word-association 
data and listeners’ choice data. 


ship. These findings, which are in sup- 
port of the model, encouraged us to 
examine the listeners' data in further 


detail. 


It is important to ascer 
how well the model, with parameters b and 
c estimated from all the data, fits selected 
subsets of the data. The generality of the 
model is considerably diminished if the con- 
stants in the power function depend on the 
values of P.(1) and P,(i). Therefore, the 
data were first partitioned on the basis of 
P.(i) and P,(i) into the following four 
(a) P.G)20, PG) =0; 


tain, for example, 


subsets: * 
P.(i) >0, 0< P,G) < 012; (c) P.) 
> 012, .012« Py (i) < .030; (d) P:(%), 
Py (i) > .030. 

re then fitted 


Data from each subset we t 
separately to Equation 7 using the iterative 
least-squares procedure. A minimum sum 
of squared deviations was obtained for each 
subset. These minimum sums of square 
deviations decreased by .5976, 2.8496, .5676, 
and .95%, respectively, from the four sums 
of squared deviations based on b= .700 and 
£—.0018. (An increment is not possible.) 
The average decrease is not statistically 
significant. Thus the parameters, 
do not appear to depend strongly on the 
combinations of P«(1) and Py i 

Let us now rur twa s 

impli ions) of the mode? 
Cae e case where b is assumed 
to be 1; that is, a simple linear function is 
used in place of the general power function 
to estimate associative strength from ae 
ciative probability. o test the adequacy E: 
this special case, e data from all 255 wori 
triples were fitted to Equation 7 by means © 
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a least-squares procedure. _ The D 
inimum sum of squared deviations is 8.7876 
ie than the minimum obtained from the 
€ 6,2) model. This increase 
meter (b,é) mode 
Hahn cen beyond the .001 level. [ees 
a rule out a simple linear 
ace oll og guum associative probability 
iative strength. 
uc simplification of the general 
model is to delete c from the power function, 
that is, set c — 0. This Simplification seems 
articularly reasonable since ĉ is so close to 
ee Note, however, that when P.(i) or 
P,(i) equal zero, then (x) or L(y) equal 
zero (see Equation 6) so that Equation 3 
does not satisfy the restriction that P.{x,y} 
0,1. If this restriction Were removed from 
Equation 3, then the predicted Py) is 1 
when P,(i) — 0 for all values of P.(i) > 0. 
However, in the data Pr{x,y} is a monotoni- 
cally increasing function of P.(i) when 
Py(i) =0 (Rosenberg & Cohen, 1964)—a 
result that is in accord with the model if 
c>0. 

Before accepting the use of the power 
function, one might wish to investigate a rea- 
sonable alternative. It may be argued, for 
example, that the probability that a listener 
chooses x as the referent in an x,y pair is 
2 monotonic function F of the difference 
between the associative probabilities P.(i) 
and P,(i), that is, 


Pina) m ho) / o) +h (y)] 
—F[P.G) —P,(i)]. [8] 
It has been shown (Adams & Messick, 
1957) that Equation 8 implies that an ex. 


ponential function relates associative proba- 
bility to associative strength, that is, 


L(x) =h exp [RP.(1)], [9] 


where h and k are constants, 
Equation 9 into the Bradley-Terry model 
(Equation 3) and making the appropriate 
transformations yields the following one- 
parameter linear equation: 


log [Pilx,y}/Pily,x}] 
=h[P-(i) — P,(i)]. [10] 


Substituting 


The sum of squared deviations from a 
linear least-squares fit using Equation 10 
was almost double (92% greater) that of 
the sum of squared deviations involving the 
power function. Unfortunately, there is no 
satisfactory statistical test for comparing 
these two sums of squared deviations. 
Nevertheless, the sizable difference in 


squared deviations and the fact that it is 
based on a relatively large number of ob- 
servations warrant a preference for the 
power transformation. 


Test of the Choice Axiom? 


A group of 236 male undergraduates 
(not previously used) were each given 
48 sets of three stimuli along with a 
speaker response for each stimulus 
triple. The same subjects were also 
given the three possible stimulus pairs 
formed from a set of three stimulus 
words along with the same Speaker 
response. Half the subjects completed 
the three-choice part first, and the other 
half completed the two-choice part first. 

The following example from this 
study may help to clarify the empirical 
test of the choice axiom. Let 4, y, and 
Z be the stimulus words LIBERTY, LAW- 
FUL, and FREEDOM, respectively; the 
speaker response i is equality. From 
the presentation of all three stimuli, 
the following Proportions were obtained 
for choices of v, y, and z, respectively : 
.538, .103, .359, Using Equation 5 it 
would be predicted, for example, that 
when only x and y are given with the 
Speaker response, P, {2,9} = .538/(.538 
+ .103) = .839. hen only x and y 
Were presented with the Speaker re- 
Sponse equality, the obtained estimate 
of Pi(x,y) was found to be .834. Simi- 
larly, predictions can be made from 
the three-choice data to the proportions 
of choices of y from y, 2 and of x 
from y, z. 

Figure 3 summ 
tained from the 
There appears to 
deviation of the set 
diagonal line. A least-squares fit of the 
points yielded a regression coefficient 
of .94. The correlation coefficient be- 
tween predicted and observed values 
was .97. In short, these data strongly 


T Data for 
Austin Ely, 


arizes the results ob- 
entire set of data. 
be no systematic 
of points from the 


this section were collected by 
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Na 3. Scatter diagrams of observed pro- 
ae a of referent choices in two-choice 
tained Lnd the predicted proportions ob- 
Eos 7 Luce’s choice axiom from the 
(Each. proportions in three-choice arrays. 
fidis int represents the data from an 

Set of three stimulus words and 
a Speaker response.) 


Sapport the proposition that listeners’ 
oices conform to the axiom. 

d AE e have already noted, estimates 
ih thts PRE strength are not required 
PAE Fh of the model. Thus, it is 
Dawes O extend this form of the 
ros lié po to stimulus materials 
is ca complex nonverbal stimu- 
soditi "g^ to speaker responses 
eive a of phrases, sentences, and 

rbal communications. 


Tr: 
STS OF THE SPEAKER MODEL 


This i i 
Ps d ion describes a number of 
nless it MA is n koc. Mesi 
pirical anal ra Vus gere d o 
data ollecte T o 25 Mero 
graduate. nox Eu m uu 
pu sS (not Previously used I 
ject Was given a deck of i eis 
consisting of instruction card. phas 
pone cards. A word vu > 
ferent underlined was printed A 
sumulas card. Forty word oes iis 
ynonyms, and 10 were unrelated usd 
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pairs. The subjects were all given the 
same 50 word pairs, but in different 
random orders. They were instructed 
to print a one-word speaker response 
on each card before going on to the 
next card. Subjects were told that 
their responses along with the word 
pairs would be shown to listeners at a 
later date and that their "scores" would 
depend on their listeners’ accuracy in 
identifying the referents. 

Results from two other speaker 
studies are also cited briefly in this sec- 
tion. These studies will be described 
when they are used. 

Consequences of the following funda- 
mental assumptions were investigated : 
(a) Intersubject homogeneity—Given 
any referent r, all subjects sample from 
the same response set Ir; (b) W ord as- 
sociates and. speaker responses—Word 
associates of the referent stimulus pro- 
vide an empirical identification of the 
responses in I,; (c) Comparison stage 
—The comparison stage depends on 
the relative associative strengths, sir) 
and s(n), of a sampled response i to 
each of the stimuli, r and n. 

Empirical consequences derived from 
the first two assumptions are param- 
eter free and are equally valid for the 
stationary and nonstationary models 
discussed in the Theory section. The 
validity of these first two assumptions 
obviously depends on the way sub- 
jects are selected. i 

Analyses concerned with assumption 

(c), that is, the nature of the compar 
son stage, depend on the assumption 
that word-association norms provide a 


satisfactory estimate of the probability 
that 7 is sampled by the speaker in the 
sampling stage. Generally speaking, 


the sampling and comparison stages 
are sufficiently intertwined so that it is 
difficult to study details of one stage 
without making explicit assumptions 


about the other. 
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Intersubject Homogeneity 


The ideal way to determine the simi- 
larity among response sets I r of a group 
of speakers is first to identify the re- 
sponses in each speaker’s I. One 
method of identifying these responses 
is to require a speaker to emit a num- 
ber of responses to each referent, a 
method analogous to continued associa- 
tions. A related method is to inter- 
sperse presentations of the same refer- 
ent with time periods filled with other 
activities so as to reduce sequential 
dependences among the speaker re- 
sponses to the referent. It is ques- 
tionable whether either of these methods 
yields an adequate sample of the re- 
sponses in an individual's 7,. 

An alternative, albeit less direct, test 
of the homogeneity assumption is as 
follows. If all subjects sample ran- 

domly from the same I, (and then go 
through the same comparison stage), 
then the preference for a specific kind 
of speaker response, for example, syn- 
onym, antonym, common responses, 
etc., is not expected to be à consistent 
Property of an individual across items 
(as it often is in the word-association 
task). Of these various idiosyncratic 
preferences, the preference for common 
responses would appear to be one least 
disturbed or suppressed by the com- 
munication task. That is, an experi- 
mental test involving the speaker’s use 
of common responses would appear to 
be the strongest test of the homogeneity 
assumption. 

Two measures of communality were 
calculated for each Speaker and for 
each of two nonoverlapping sets of 20 
synonym pairs. These measures are: 
(a) the number of popular responses 
given by a speaker where a popular re- 

sponse is defined for each synonym 
pair as the most frequently occurring 
Speaker response for that Synonym 
pair, (b) the number of unique re- 
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sponses where a unique response is any 
speaker response to a synonym pair not 
given by any other speaker. The split- 
half reliability coefficients, based on the 
272 speakers, were .041 and .042 (t 
>.50) for the popular and unique 
scores, respectively. . 
It is possible that the null hypothesis 
was not rejected because the individual- 
difference measures were based on too 
few word pairs. To check on this pos- 
sibility, split-half reliability coefficients 
were also calculated from word-asso- 
ciation data using the same number of 
items (20-20) and subjects (272), 
Word-association data from the same 
stimulus words as those in the speaker 
study were used. Split-half reliability 
coefficients based on the word-associa- 
tion data were .449 (p< 001) and 
570 for popular and unique response 
Scores, respectively, The sizable dif- 
ference between these correlations and 
those based on the Speakers' data sug- 


gest that a sufficient number of items 
was used to detect j 


ences in commun 
if these differenc 


These 
selected from the S 


who had been in th 
Study described in 
Their communality 
association test we 
for Selection; h 
subjects had hig! 


ample of subjects 
€ word-association 
à previous section, 
Scores on the word- 
Te used as a basis 
alf of the ‘selected 
c h-communality scores 
(first quartile), and the other half had 
low-communality Scores (fourth quar- 
tile). They were given the speaker 
task 6 months after their participation 
in the word-association study. The 
speaker-response frequencies from each 
of the two Broups were compared. Re- 
Sponse types given to each referent- 


S 


)" 


nonreferent pair were grouped so as to 
yield an expected frequency of at least 
5 in each cell of a groups (2) X re- 
sponse class matrix. A chi-square for 
each of the 25 referent-nonreferent 
pairs was then computed from this 
matrix of response frequencies. The 
sum of the 25 x? values was 60.1 (p 
< .30, df = 56). Evidence from other 
work (Jenkins, 1960, p. 317) appears 
to rule out the possibility that statistical 
regression in word-association com- 
munality scores accounts for the ab- 
sence of a difference between the two 
groups. The present result is in ac- 
cord with the homogeneity assumption 
for the speaker task. It also raises the 
interesting question of why subjects 
who were consistently different in the 
word-association task were not also 
different in the speaker task. This 
question will be discussed in a later 
section on Intersubject Homogeneity 


in Word Association. 


Word Associates and Speaker Re- 
sponses 


According to the theory—assump- 
tion b above—it would be expected 
that speaker responses to a referent are 
a subset of the word associates of that 
referent regardless of the nonreferent, 
of the details of the comparison stage, 
or of any changes in sampling proba- 
bility that may occur when one or more 
sampled responses are rejected. How- 
ever, a sample of word associates of a 
referent stimulus may not contain all of 
the obtained speaker responses to that 
referent even though the sampling as- 
sumptions are correct. This is evident 
from the fact that the pr i f 
speaker respo: ee 
ve di ie mses that also occur as 
sample diet AVE Upeo RESCUE 

Therefore, it was necessary to de- 
termine the proportion of speaker re- 
sponses that are likely to be present in 
a sample of word associates of a given 
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size, assuming the theory is correct. 
This proportion was estimated empiri- 
cally by using the proportion of word 
associates of r that are found at least 
once in an independent sample of word 
associates of r. 

Data from 11 stimulus words were 
used in the analysis. These words 
were selected because they were on the 
Kent-Rosanoff list and, hence, were 
words with extensive normative data 
available from other investigators. 
Also they were referents in pairs of 
synonym words in the speaker study. 
For each stimulus word, we deter- 
mined (a) the proportion of a sam- 
ple of 272 word associates in our 
own word-association study which were 
found in samples of word associates 
collected by other investigators, (b) 
the proportion of a sample of 272 
speaker responses present in the word- 
association norms of other investiga- 
tors. 

Figure 4 summarizes the results. 
The curves are necessarily monotonic 
since they show the average proportion 
of samples of words (word associates 
or speaker responses) that were found 
in cumulative samples of word associ- 
ates. That is, the leftmost point on 
any curve is the average proportion of 
responses found in a sample of 32 word 
associates given by Rutgers male under- 
graduates ; the second point is the aver- 
age proportion found in the Rutgers 
sample of 32 plus a sample of 500 Min- 
nesota male undergraduates, etc. 

The bottom curve i Figure 4 was 
obtained by randomly pairing each of 
the stimulus words with another stimu- 

8 Word associates alien aere, cdd Ens 
lated the MEUS Tait word, were deleted. 
example, the nones he number of 


i word, t 
For each stimulus n b 
speaker responses used in the analysis was 


d associates 
the number of word | 

Be ined is randomly selecting this number 
E: perte responses from the total speaker 


sample of 272 responses. 
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Fic. 4. The average proportion of word 
associates or speaker responses (ordinate) 
found in independent samples of word- 
association data (abscissa). (Based on 11 
stimulus words from the Kent-Rosanoff list. 
Sources of word-association data are shown 
along the top of the graph—Rutgers males: 
Rothberg, 1965; Minnesota males: Palermo 
& Jenkins, 1964; Brooklyn males and fe- 
males: Unpublished norms collected by S. J. 
Segal, 1960; Minnesota males and females: 
Russell & Jenkins, 1954; Minnesota females: 
Palermo & Jenkins, 1964.) 


lus word from the set of 11, for ex- 
ample, ANGER, BEAUTIFUL, and count- 
ing the number of speaker responses to 
one (say ANGER) found in the word 
associates to the other (say BEAUTI- 
FUL). This is essentially a check on 
the possibility that a substantial pro- 
portion of speaker responses can be 
found as word associates to any stimu- 
lus word if the sample of word associ- 
ates is large enough. Apparently this 
is not true. 

By contrast with the bottom curve, 
the two top curves are close together 
and may be converging. Furthermore, 
those speaker responses not found 
among the largest sample of word as- 
sociates consisted almost exclusively of 
speaker responses that occurred once 
only That is, the high-frequency 
speaker responses almost invariably 
showed up as word associates of the 
referent. It was concluded from 
these findings that, except for an in- 
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consequential number of the J, re- 
sponses with the smallest P,(z) values, 
there are no responses in 7, which can- 
not be identified independently in a 
sizable sample of word associates. 


Comparison Stage 


It is difficult to verify, in detail, the 
assumptions concerning the comparison 
stage. This difficulty exists primarily 
because the sampling and comparison 
stages are so intertwined and because 
the sampling probability P,(i) “domi- 
nates” the behavior effects of the refer- 
ential process. 

To begin with we will demon- 
strate only the gross effects of the 
comparison stage. This will be done 
by first constructing a model consisting 
only of a sampling stage so that the 
nonreferent is assumed to have no ef- 
fect on the emission of a speaker re- 
sponse. In such a one-stage model, 
the speaker response is essentially a 
random selection from the distribution 
of responses in Z, It will then be 
shown that this one-stage model is 
inadequate and that these inadequacies 
suggest the existence of a comparison 
stage whose outcome depends on as- 
sociative strengths s;(r) and s(n). 

Fitting a one-stage model to data is 
relatively straightforward. However, 
the unconditional probability, 5; that 
the speaker emits i is not necessarily 
equal to P,(i) estimated from word-as- 
sociation norms since there are re- 
sponses in the word-association norms 
that the speaker does not emit because 
he would violate the instructions not to 
give a response "that sounds like or is 
spelled like" either of the stimulus 
words. For these words it is assumed 
that P,(i) — 0. For the remaining 
words, the relationship between pi and 
P,(i) in the one-stage model is, 


bi = PL) Y. P,(j). 


ely 


[11] 
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Equation 11 is essentially a special case 
of Equation 2. That is, the assumption 
that a response that is sampled is in- 
variably emitted is equivalent to setting 
P.(i|i) and P.(j|j) equal to 1 in Equa- 
tion 2. In effect, the associative prob- 
ability of a word that violates the in- 
structions to the speaker is redistrib- 
uted among the legitimate words in 
proportion to the P,(i) values of the 
latter. "According to Equation 11 the 
relationship between p; and P,(i) is 
linear. 

The one-stage model summarized by 
Equation 11, in combination with word- 
association data, was used to predict 
the distribution of speaker responses 
from synonym pairs. Data from un- 
related word pairs were treated sepa- 
rately for reasons which will be given 
later. 

Figure 5 summarizes the relationship 
between the observed average propor- 
tion of speaker responses to synonym 
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Fic. 5. Average observed 

speaker responses versus [Eee iib 
predicted from sampling (one-stage) model 
(The diagonal is the set of points in which 
predicted frequency equals observed fre- 
quency. The solid curve is based on data 
from 40 synonym pairs—present speaker 
study. The dashed curve is based on data 
from 25 of these synonym pairs obtained in 
a previous study—Rosenberg & Cohen, 
1964.) 
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GIVEN TO REFERENT [Pr (i)] 


Fic. 6. The relation between the propor- 
tion of times that a response word occurred 
as a word associate of the referent [P.(1)] 
and the average proportion of times that the 
same response word occurred as a word 
associate of the nonreferent [Pa(i)]. (Based 
on 25 synonym pairs.) 


AVG. ASSOCIATIVE PROBABILITY 
OF SAME WORDS GIVEN TO 


pairs and that predicted from word- 
association data using the one-stage 
model. Data from two speaker studies 
are plotted separately. One curve was 
obtained from an independent sample 
of 108 subjects used in a previous 
study (Rosenberg & Cohen, 1964) ; the 
second curve was obtained from the 
speaker study described in the present 
paper. Response types with similar 
predicted values were grouped to ob- 
tain the average values plotted in the 
figure. 

If the sampling model were com- 
pletely adequate, all data points would 
fall along the diagonal in the graph 
or, at least, would fall on a straight 
line. In both studies, a systematic de- 
viation from this prediction occurs for 
the high-probability speaker responses 
where the observed frequency of speaker 
responses is considerably smaller than 
the predicted frequency. The question 
is how the comparison stage which was 
omitted for this analysis can account 
for this deviation from prediction. 

A possible answer to this question 
was found by examining the asso- 
ciative probabilities to referent and 
nonreferent of these high-probability 
speaker responses. Figure 6 summar- 
izes this relationship for 25 of the 
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airs used in both speaker 
ee The graph (which does not 
contain any speaker data) shows that 
the probability that a word associate 
of the referent will also be given as a 
word associate to the nonreferent is 
proportionately greater for high- than 
for low-probability associates of the 
referent. The effects of a comparison 
stage would thus be particularly notice- 
able for the high-probability word as- 
sociates since their associative strengths 
to referent and nonreferent are both 
high. That is, the speaker will tend to 
reject the high-probability responses in 
I, because of their similarity in associa- 
tive strength to responses in the non- 
referent set; low-probability responses, 
on the other hand, will be more likely 
to be emitted when they are sampled. 
The same effect was observed in a 
study by Krauss and Weinheimer ® 
on the communication of colors. They 
demonstrated that when the hue of 
the nonreferent colors was similar to 
that of the referent, speakers used one 
of the less common (low-frequency) 
names to communicate the referent 
color, In dissimilar stimulus arrays 
the agreement among speakers in the 
names they used to refer to a color 
was extremely high. As we shall see, 
the latter finding corresponds to the 
behavior of our speakers when they are 
given unrelated word pairs. 
For unrelated word pairs, the associ- 
ative probability of any sampled re- 
sponse to the nonreferent is, for all 
practical purposes, zero. That is, a 
plot comparable to Figure 6 for an un- 
related word pair would generally be a 
horizontal straight line with a close- 
to-zero intercept. Also, there are very 
few (if any) word associates of the 
referent that violate the instructions to 
the speaker. According to a sampling 
model, therefore, when the referent and 


9 Unpublished manuscript, 1965. 


nonreferent are unrelated, the distribu- 
tion of speaker responses to a referent 
is expected to be essentially the same 
as that of the word associates to that 
referent. 

However, according to the two-stage 
speaker model (sampling and compari- 
son) the probability that the word 7 is a 
speaker response is greater than (less 
than) its probability, P.(i), as a word 
associate for high-probability ( low-prob- 
ability) word associates. An intuitive 
argument for this proposition is as fol- 
lows. First, for unrelated word pairs 
the lower the associative strength of a 
sampled word to the referent, the closer 
this value will be to the associative 
strength of the sampled word to the 
nonreferent; the latter being the mini- 
mum possible value of associative 
strength. Hence, the speaker will 
probably judge remote associates as in- 
adequate to communicate the referent 
to his listener. Second, in contrast to 
Synonym nonreferents, unrelated non- 
referents are not likely to have any 
high-probability associates in common 
With the referent. Hence, whenever a 
high-probability associate is sampled, 
the speaker will probably judge it as 
an adequate communication. The net 
result of these Processes in the case of 
unrelated nonreferents, therefore, is to 
constrain the speaker towards clue 
words of higher communality than he 
might emit in standard free word as- 
sociation. 

The formal proof of the above prop- 
osition is as follows. For unrelated 
word pairs we assume that the associa- 
tive probability of a speaker response i 
to the nonreferent, P,(i), equals zero 
for all iel, Thus, the associative 
strength s;(n)= A[P,(i) + C]?=AK 
where A,B,C, and K are constants 
each > 0. From Equation 1 we obtain 


the conditional probability of emitting 
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sampled response i: 


[P.G) CP «X 


Equation 12 implies that P,(i|i) is a 
monotonically increasing function of 
P,(i). We now rewrite Equation 2 in 
the following form: 


P.) = 


Bi oon PN 
P, (i) ` P(A)Pe(Gl i) 


Since the sum in the denominator of 
the right-hand expression is a weighted 
average of all possible P, (ili) values, 
the equation is equal to a quantity less 
than 1 when P,(i/i) is at a minimum 
and to a quantity greater than 1 when 
Pe (ili) is at a maximum. In particu- 
lar, [^|P,(i)] < 1, that is, pi < P,(i), 
when P,(ili) is at a minimum. Fur- 
thermore, since P,(i|i) and P,(i) are 
monotonically related (Equation 12), 
P.) is also at a minimum when 
Pe(ili) is at a minimum so that P, 
< P,(i) when P,(i) is at a minimum. 
By a similar argument it is found that 
ti > P,(i) when P,(i) is at a maxi- 
mum. 
. The empirical results are summarized 
in Figure 7. These results reveal that 
speaker-response types (dashed curve) 
occur with greater frequency than their 
frequency as word associates (solid 
curve) for the most common word as- 
cd of the referent stimulus; the 
1s true for the less common 
word Associates. This observed rela- 
tionship between the two curves is 
that predicted by the two-stage model.!'? 
10 A possible contaminating effect that was 
observed among the speaker responses was 
a heightened occurrence of synonyms as 
speaker responses when the two stimulus 
words were unrelated. This “synonym-re- 
sponse effect” may exaggerate the effect 
predicted from the two-stage model if syno- 
nyms are found more frequently among 
high- than among low-probability word as- 
sociates. When the frequency of speaker 
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shows the average rank-proportion relation- 
ship for responses given to 10 stimulus 


words. (Curve labeled “Speaker-Responses” 
shows the average rank-proportion relation- 
ship for the same responses given as speaker 
responses to the same stimulus words; the 
nonreferent in all stimulus word pairs was 
unrelated to the referent.) , 


By contrast, a one-stage model predicts 
that the two curves will coincide at all 
points. The x? value comparing the 
frequencies of response types given as 
word associates and as speaker re- 
sponses was 155.2 (p < .00001, df = 
20). Thus, the speaker data from un- 
related word pairs provide additional 
and somewhat different evidence for 
a comparison stage whose outcome de- 
pends on the associative strengths s;(7) 
and s; (11). 

The data presented in Figure 7 also 
support the fundamental proposition 
that P,(i) can be estimated from the 
associative probability of i to r. From 
this proposition we would predict a 
strong monotonically increasing rela- 
tionship between p; and P,(i), provid- 
responses (in a related type of communica- 
tion task) was adjusted according to the 
correlation between frequency and synonym- 
ity, the effect predicted by the model was 
still highly significant (Rothberg, 1965). 


The synonymity between a stimulus word 
and its response was based on judges 


ratings. 
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ing P.(i|i) and P,(i) are either un- 
correlated or are positively correlated. 
This monotonic relationship was also 
found in the data from synonym pairs 
although it is difficult to derive from 
the model for synonym pairs since the 
correlation between P,(i|i) and P,(i) 
is highly variable. 


Goodness of Fit of the Two-Stage 
Model 


The only serious obstacle to fitting the 
stationary Markov model (Equation 2) to 
the data is the fact that the parameters of the 
power function (relating associative strength 
to associative probability) are unknown. 
These unknown parameters are B and C in 
the function s(() = A[Px(i) + C]2. (The 
parameter 4 cancels when the function is 
substituted into the Bradley-Terry model.) 

One approach to the problem of estimating 
B and C is to assume that they are equal to 
the b and c, respectively, which enter into 
listener's comparison (Equation 6)—b and é 
would then be used as estimates of B and C. 
This is an attractive approach to the esti- 
mation problem since these parameter esti- 
mates are completely independent of the 
speakers’ data, 

Letting Ê — 700 and € = .0018 in the 
two-stage stationary model, we  calcu- 
lated a set of predicted 5, values for the 
speaker responses to each word pair. The 
chi-square values were then computed from 
the observed pı values and the f, values pre- 
dicted from the model. The chi-square 
values were found to be of approximately 
the same size as those obtained from a com- 
parison of obtained and predicted 5, values 
from the sampling (one-stage) model. In 
short, the parameter estimates obtained from 
the listeners’ data are inadequate for use in 
the comparison stage of the speaker. 

A second approach to the problem of esti- 
mating B and C is the use of an iterative 
least-squares procedure involving speakers’ 
and word-association data. The sum: 


P3 (b; — pi)? 


was minimized with respect to B and C 
using an algorithm described by Marquardt 
(1963). This procedure did not yield esti- 
mates of B and C that resulted in any sig- 
nificant reduction of the chi-square values 
from those based on the parameters of listen- 


er's comparison. Since the properties of 
this iterative least-squares procedure (in- 
volving as it does a highly nonlinear mode!) 
are not well understood, we hesitate to 
conclude, without further research, that 
the comparison model used is inappropriate 
for the speaker. It may be the case, of 
course, that the Bradley-Terry model in 
combination with the power function is not 
appropriate for the speaker's comparison 
stage. For example, for sampled responses 
where P,(i) =P,(i), it may be the case 
that si(r) > se(n) ; that is, the conditional 
probability of emitting i may be greater 
than .5. That is, the speaker's decision to 
emit a response may be biased by the fact 
that he knows the referent. This type of 
bias cannot be accounted for by any com- 
bination of values of B and C. A possible 
answer to this question may be furnished by 
a study in which the sampling of a response 
is done for the speaker by the experimenter, 
and the speaker has the choice of using the 
response as a speaker response or rejecting 
it as inadequate. 


INTERSUBJECT HOMOGENEITY IN 
Worp ASSOCIATION 


Because of the intimate relationship 
between 7, and word-association norms, 
it is natural to extend, if Possible, the 
assumption of intersubject homogeneity 
to the word-association situation. One 
of the immediate consequences of the 
intersubject homogeneity assumption in 
the word-association situation is that 
the response Protocol (one response to 
each of a set of stimulus words) for any 
individual subject will contain as many 
popular and unique responses as that of 
any other randomly selected Subject, 
The facts, however, clearly Contradict 
this proposition. Highly reliable dif- 
ferences among subjects in their 
communality scores have been demon- 
strated in the results of other investi- 
gators (Fosmire & Tryk, 1963; Jen- 
kins, 1960) as well as in our own word- 
association data. (In our data the 
split-half reliability coefficient based on 
responses to 152 stimulus words was 
.82.) In other words, different sub- 
jects contribute disproportionately to 
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different parts of an average rank-fre- 
quency distribution of word associates. 

How can these facts be reconciled 
with the findings that group word- 
association norms do provide satis- 
factory estimates of J, and that inter- 
subject homogeneity in the speaker 
task is a tenable asssumption? It may 
be the case that underlying response 
sets in the word-association situation 
vary from subject to subject, but group 
norms somehow estimate J, accurately, 
and these J, sets do not differ from 
speaker to speaker. This ad-hoc as- 
sumption is clearly not as attractive as 
the assumption of intersubject ho- 
mogeneity in both word-association and 
speaker tasks. In addition, the ques- 
tion of intersubject homogeneity in 
word association also arises in con- 
nection with the use of associative 
strength and associative overlap based 
on word-association norms in a large 
variety of studies of verbal learning 
= (Cofer, 1961; Deese, 

The highly reliable individual dif- 
ferences in communality scores obtained 
Írom word-association responses can, 
however, be reconciled with a ho- 
mogeneity assumption by postulating 
a two-stage process in the word-as- 
sociation task. According to this con- 
ception, the subject in the word-as- 
sociation task first samples a response 
from a hypothetical set of responses 
and then compares certain properties 
of this sampled response with criteria 
supplied by the experimental instruc- 
tions (as in controlled word association) 
or supplied by the subject himself (as 
in free word association). The subject 
either emits the sampled response or 
rejects it depending on how closely the 
sampled response approximates the as- 
signed or self-instructed criteria. With 
such a two-stage model the assumption 
can be retained that all subjects from a 
judiciously chosen population sample 


from substantially the same response 
set, but that the subjects may vary in 
their self-instructed criteria. 

A number of investigators have 
noted the existence of several self- 
instructed criteria (of which the sub- 
ject may not be aware). In an exten- 
sive study of this question, Moran, 
Mefferd, and Kimble (1964) used a 
factor-analytic technique and identified 
three distinct self-instructed criteria 
which they term “idiodynamic sets." 
Moran et al. also present evidence to 
the effect that when the popular re- 
sponses to a stimulus word are con- 
gruent with the subject’s idiodynamic 
set, the subject contributes dispropor- 
tionately to the frequency of these re- 
sponses and makes fewer “mediation 
faults,” for example, giving multiword 
responses, failing to respond within the 
time limit, giving remote associates 
such as box to Farru, etc. These find- 
ings are consistent with a two-stage 
model for word-association behavior in 
which the idiodynamic set is operative 
during the comparison stage only. 
From the results of the speaker studies 
reported in the previous section, it is 
concluded that these idiodynamic sets 
are suppressed in the communication 
task in favor of a uniform communica- 
tion set. 

If the two-stage model does hold for 
free word association, however, there 
is another dilemma. How can free- 
word-association norms be used to esti- 
mate sampling probabilities from J; if 
these norms have been biased by a 
variety of subject-controlled criteria 
during comparison in free word associa- 
tion? One possibility is that the effect 
of the sampling stage in word associa- 
tion is strong enough to prevent these 
biases (in the comparison stage) from 
seriously distorting the sampling prob- 
ability of most responses in the under- 
lying set of associates. The results of 
our experimental analyses concerning 
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the sampling stage in the speaker's 
referential process suggested that this 
stage is the dominant one. An alter- 
native hypothesis is that these idio- 
dynamic sets “cancel” each other so 
that group norms yield adequate esti- 
mates of the sampling probabilities. Of 
course, these hypotheses require fur- 
ther empirical study. For the moment, 
the justification for using free-word- 
association norms in tests of the speaker 
model is essentially the same as it is 
in studies of verbal learning and media- 
tion—an heuristic one. 


RECOGNITION AND RECALL PROCESSES 


The essential operational difference 
between recall and recognition is that 
in recall the subject is required to 
reproduce the items on a previously 
presented training list, while in recogni- 
tion the subject is required only to 
judge whether items, produced for him 
by the experimenter, were on the 
training list. It is suggested that 
performance on recall tasks is deter- 
mined by a two-stage process formally 
similar to the present speaker model, 
and recognition by a one-stage process 
similar to the present listener model. 

According to this conception, in re- 
calling an item, a subject first samples 
an item from an underlying set that 
consists of a list and an extralist 
subset. He then goes through a com- 
parison stage that determines the prob- 
ability that the sampled item will be 
emitted. In the comparison stage of 
the recall process, the subject, who 
must decide whether or not the sampled 
item is in the list subset, may do this 

by comparing the associative strength 
of his sampled response with a con- 
stant representing a threshold value of 
associative strength. That is, the prob- 
ability that an item will be emitted, 
given that it has been sampled, equals 
ti/(ti +k), where f; is the subjective 
probability (associative strength) that 


the sampled item is a list member, and 
k is some minimal value of associative 
strength below which the subject will 
not accept the item as a list member. 
In a recognition test, only the com- 
parison stage is entailed. 

The model of recall learning recently 
proposed by Kintsch and Morris 
(1965) is related to the two-stage 
model described in the present paper. 
They hypothesize that -recall learning 
is composed of a “recognition stage and 
a 2nd stage where the response be- 
comes available in the absence of the 
stimulus [p. 200].” In effect, in learn- 
ing there is a change in sampling prob- 
ability of an item and a change in the 
subjective associative strength of the 
item above the threshold value b. Thus 
recognition may be superior to ¢ecall 
for at least two reasons: (a) In recog- 
nition the sampling stage is eliminated 
as a variable affecting performance ; 
(b) the probability of correctly recog- 
nizing an item as a list member in- 
creases faster than its sampling prob- 
ability. The latter effect corresponds 
to a finding observed in our data: A 
Speaker response often exists which 
would result in a low proportion of 
listener errors, but is rarely given by 
speakers because it is a remote as- 
Sociate to the referent; that is, it has 
a low sampling probability. 

The advantages of recognition over 
recall can be considerably reduced by 
manipulations that raise the sampling 
probabilities of list members and pro- 
portionately lower those of the extra- 
list members of the underlying set, 
for example, overlearning or the use of 
contextual cues. This could make the 
recall task functionally equivalent to a 
recognition task with respect to item 
availability. A study by Lachman and 
Field (1965), for example, showed that 
recall scores did reach, and even sur- 
pass, recognition scores when the list 
of items was presented a sufficient 
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number of times prior to the reten- 
The actual superiority at- 
tained by the recall scores in Lachman 
and Field's (1965, p. 25) study, after 
16 or more training trials, can be at- 
tributed to the fact that the training list 
was presented in a serial order that 
had a "high approximation to English 
(i.e, prose)." On the recall tests, this 
had the effect of further raising not 
only the sampling probabilities but 
also the associative strengths of the 
items, since the subjects were free to 
use the sequential dependences among 
the items as contextual cues during 
recall. On the recognition tests, how- 
ever, subjects were given list and non- 
list items in random order. Hence, 
the sequential dependences among the 
items could not provide the same in- 
crements in associative strength on 
the recognition as on the recall tests, 
since the items were always sampled 
for the subject out of context. 

In multiple-choice recognition tests 
of the type analyzed by Murdock 
(1963), the present listener model ap- 
pears to be directly applicable. The 
subject is in essentially the same situa- 
tion as the listener in the word-com- 
munication task; he must choose a 
correct item from two or more alterna- 
tives. The listener-comparison model 
is appropriate to the job of predicting 
the proportion of correct choices in such 
situations provided that information 
is available concerning the associative 
strengths of the individual alternatives. 

Data relevant to this proposition 
were obtained in a probabilistic paired- 
associate study by Donner (1965) in 
which retention was measured with a 
two-alternative recognition test. In 
the training phase, Donner showed sub- 
jects (N = 100) a series of 160 letter- 
number pairings flashed on a projection 
screen at a ./ second/.7 second on-off 
rate, Each pairing consisted of one of 
four letters (F, M, P, W) and one of 


four digits (3, 6, 8, 9). The subjects 
were told to "learn the various com- 
binations that appear. The same letter 
may appear with a different number 
[p. 3]." The purpose of this pro- 
cedure was to generate a set of associa- 
tive probabilities of each of the re- 
sponse terms (the digits) to each of 
the stimulus terms (the letters), and 
then to use these probabilities as the 
word-association normative probabili- 
ties were used in the earlier listener 
study. 

In the recognition test, the subjects 
were given a series of triples. Each 
triple consisted of two of the four 
stimulus letters and one of the four 
response digits. Subjects were re- 
quired to choose the letter that “best 
goes with the number on the basis of 
what you have learned [p. 9]." The 
triples used in these tests were selected 
to permit assessment of subjects’ 
choices over a wide range of differences 
in the associative probabilities of the 
response term to each of the stimulus 
terms. 

The results were fitted to the listener 
model (Figure 8) using as estimates 
of P,(i) and P,(i) the proportion of 
times a response i was paired with the 
stimuli x and y, respectively, during the 


4 


w 


LOGe [Pi {x,y} /P1 {y.x}] 


D 2 3 4 5 $ 
LOGo {[Px(t) + 0.0050] / [eytt)+0.0080)} 


Fic. 8. The use of a single-valued func- 
tion of P.(i) and P,(G) derived from the 
listener-comparison model to relate induced 
associative probability and choice data in the 
recognition task. (Each plotted point repre- 
sents data from an individual +,y,i triple.) 
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training phase. The proportion of sub- 
jects choosing x in the recognition test 
was used as an estimate of P;{x,y}. 
The parameters b and c in the model 
were estimated by the iterative least- 
squares procedure used in the listener 
study. The slope b of the fitted line in 
Figure 8 is .623. These parameter 
estimates probably depend, in part, on 
the number of training trials, but are 
similar in order of magnitude to those 
found in the listener study with word- 
association data. As in the listener 
study, special cases of the power func- 
tion, that is, b=1 or c= 0, resulted 
in considerably poorer fits. Donner 
(1965) also showed that the simple dif- 
ference, P,(i) — P,(i), was not ade- 
quate to predict recognition data, 
The models outlined in this paper 
were constructed to account for speak- 
ers' and listeners' responses in referent 
communication; the Donner study and 
the test of the choice axiom with lis- 
tener data suggest that these models 
may also have promise for the develop- 
ment of new approaches to some of 
the more conventional problems in 
verbal learning and performance. Rote- 
learning techniques might, in turn, 
permit more definitive tests of the 
speaker model than have so far proven 
possible. That is, such techniques 
might permit more precise specification 
and control of the sets from which 
speakers' responses are sampled, and 
the sampling probabilities and associa- 
tive strengths of the responses in these 
sets. 
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INHIBITION OF THE UNCONDITIONED RESPONSE 
IN CLASSICAL CONDITIONING? 


H. D. KIMMEL 2 
University of Florida 


The classical conditioning process is characterized as a “dynamically” 
gradual transition from an initial excitatory state, in which the €S 
produces a short-latency CR, to an increasingly inhibitory condition, in 
which both the CR and UCR are attenuated and the former is greatly 
delayed, depending upon the CS-UCS interval used during acquisition. 
The highest point in the CR magnitude function is assumed to occur in 
fewer trials when long interstimulus intervals are used and in more 
trials when shorter intervals are used. UCR diminution is shown to 
depend upon an associative process by the device of omitting the CS 
and noting the increase in UCR strength thus produced. This 
increment has been found to be a function of the number of prior paired 
conditioning trials, the CS-UCS interval (in conventional fashion, with 
a peak at about .5 sec.), and whether a delayed or trace conditioning 


paradigm is used. It is suggested th: 
delay, upon the basis of which these 


at the development of inhibition of 


effects are assumed to occur, may 


be similar to differential or selective reinforcement (of an instrumental 
type) of late as compared with early responses. When the CR is made 


instrumental in shock avoidance, 
are not observed. A study of the 


and judged intensity of electric sh 
notion that this reinforcement may b 
the UCS when late responses occur, bi 


results, 


The purpose of this paper is to bring 
together under a relatively parsimoni- 
ous theoretical mantle a variety of em- 
pirical observations, most of which 
have only recently become part of our 
classical conditioning literature. The 
basic proposition elaborated below is 
that classical conditioning, rather than 
a “dumb” or senseless kind of learning 
(Tolman, 1938) or a mere artifact of 
instrumental learning (Smith, 1954), 
is, in fact, a quite flexible and emi- 
nently adaptive process. Since the 
basic argument involves, in some ways, 
a "marriage" of classical and instru- 
mental conditioning concepts, it should 
come as no surprise that "something 

old and something new" are present at 
the ceremony. The “something old” is 
1 Done under Grant M H-06060-02 from the 


National Institute of Mental Health. 
? Now at Ohio University. 
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CR attenuation and UCR recovery 
relationship between GSR amplitude 
ock failed to provide support for the 


€ due to reduced noxiousness of 
ut the issue is not closed by these 


Pavlov's (1927) concept of inhibition 
of delay. The "something new" con- 
cerns the effect of such inhibition on 
the magnitude of the unconditioned 
response (UCR). 

In classical salivary conditioning the 
adaptive function of inhibition of delay 
seems quite apparent. It is surely 
more economical to the dog to salivate 
late in the interstimulus interval, just 
before the food or acid unconditioned 
stimulus (UCS) is to be delivered, 
than to do so at the very onset of the 
conditioned stimulus (CS), long before 
the secretion can be of use in the masti- 
cation and digestion of the food or in 
the dilution of the acid, Tt is less ap- 
parent that delay of, say, a conditioned 
galvanic skin response (GSR) or eye 
blink could be of any adaptive value, 
yet this is what we will suggest. For 
expository purposes we will deal first 
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INHIBITION oF UCR iw 


with studies in which the CS has been 
shown to have an inhibitory influence 
upon the conditioned response (CR) 
and then with studies in which the 
UCR is similarly inhibited by the CS. 
As will be brought out below, these 
effects are better thought of as mani- 
festations of a unitary process influenc- 
ing both responses, the present distinc- 
tion being one of convenience only. 


Inhibition of the CR by the CS 


The phenomenon of inhibition of 
delay, itself, exemplifies the manner in 
which the CS gradually becomes cap- 
able of attenuating and delaying the 
CR it had previously been conditioned 
to elicit. In salivary conditioning, 
Pavlov (1927), of course, has amply 
demonstrated the degree to which such 
inhibition accrues under conditions of 
increasingly long delays of the UCS, 
and, in addition, he has shown how an 
exteroceptive stimulus presented dur- 
ing the latent interval acts to disinhibit 
this inhibition and bring forth an im- 
mediate and sometimes increased saliv- 
ary CR. Wenger (1936) and Rodnick 
(1937a) have reported more or less 
similar results with the human GSR 
and eyelid reflex. Rodnick’s (1937a) 
study showed, further, that greater in- 
hibition of delay was present using a 
“delayed” paradigm than with a “trace” 
paradigm, tying the process even more 
closely to the presence of the CS at the 
moment the UCS occurs rather than 
to its mere onset. 

The degree to which an added stimu- 
lus produces an increment in GSR 
amplitude has been shown to depend 
upon the number of CS-UCS pairings 
prior to the introduction of the extra 
stimulus (Kimmel & Fowler, 1961: 
Kimmel & Greene, 1964). Figure 1 
summarizes a portion of the results of 
the former study. In Figure 1 it can 
be seen that the average GSR to the 
combination of the visual CS and an 
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added auditory stimulus significantly 
exceeded the average GSR to the visual 
CS alone only following 12 CS-UCS 
pairings, but not following either 4 or 8 
pairings. The response to the trained 
CS alone rose for eight trials and then 
fell. This effect is even more striking 
when a greater number of paired trials 
are used before introducing the novel 
stimulus (Kimmel & Greene, 1964). 
In that study it was shown that the re- 
sponse to the novel stimulus continued 
to rise in negatively accelerated fashion 
throughout 50 paired trials, while the 
response to the CS alone first rose 
and then dropped. 

The Kimmel and Greene study also 
showed that the time during the CS- 
UCS interval at which the extra stimu- 
lus is delivered is a variable which 
greatly influences the ultimate response 
pattern observed at different stages of 
conditioning. In that study a 7.5-sec- 
ond interstimulus interval was used, 
and the extra stimulus was delivered at 
five different points in the interval— 
at its onset, after one-fourth, one-half, 
or three-fourths of its duration, or at its 
termination. Kimmel and Greene 
found that the time of onset of the 
extra stimulus during the CS-UCS in- 
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terval mattered little prior to condition- 
ing, that after a couple of conditioning 
trials the response to the novel stimulus 
was greatest at the beginning of the 
CS-UCS interval, and that late in con- 
ditioning the response to the novel 
stimulus was greatest just before the 
delivery of the UCS. They inter- 
preted their results to mean that, dur- 
ing conditioning, inhibition of delay 
produces a gradual shift in the focus of 
excitation within the CS-UCS interval, 
from its beginning to its end. The 
ultimate consequence of inhibition of 
delay, as Pavlov himself noted, may be 
the total inhibition of the CR during 
the interstimulus interval. 

It is of some interest to note that 
this form of inhibition, that is, initiated 
and controlled by the CS during the 
interstimulus interval, may result in 
organism-wide tranquility. Rodnick 
(1937b), for example, found that the 
eyelid reflex, conditioned to a different 
CS previously, was depressed when 
measured during the long delay interval 
of the CR currently being conditioned 
to a new CS. Spence and Runquist 
(1958) showed that the unconditioned 
eye blink to a weak air puff was 
depressed when elicited after a CS pre- 
viously paired with electric shock had 
been on for $ second. Tranquility dur- 
ing conditioning has also been found to 
lead to drowsiness and even sleep by 
Pavlov (1927). 

Pavlov recognized that every CS, 
even in the presence of reinforcement, 
regularly acquires inhibitory as well as 

excitatory properties. Indeed, he 
wrote: “The function performed by the 
unconditioned reflex after the condi- 
tioned reflex has become established is 
merely to retard the development of 
inhibition [1927, p. 234]." That this 
is true in GSR conditioning is rela- 
tively well known (Kimmel, 1964). 
Several studies have shown that the 
course of the curve of conditioned GSR 


amplitude during reinforcement is first 
upward, for varying numbers of trials, 
and then downward, until eventually it 
may fall below even the level observed 
before conditioning (Grings, Lockhart, 
& Dameron, 1963; Kimmel, 1959; 
Meryman, 1953). The number of 
trials required for the highest point in 
the curve to be reached appears to be 
inversely related to the CS-UCS inter- 
val employed. Meryman (1953) used 
an interval of 19 seconds and found a 
peak in the CR amplitude curve at 
about two or three trials. Kimmel 
(1959) used a 4-second interval and 
Observed a peak at six to eight trials. 
Grings et al. (1963) ran two groups 
with intervals of .5 second and 5.0 
seconds and found a peak in the 5.0- 
seconds condition at about five trials, 
while the CR amplitude curve was still 
rising in the .5-second condition after 
eight trials. Support for this generali- 
zation may be found even in eyelid- 
conditioning data. Ost (1960) found 
that the highest point in the percent- 
age of CR curve was reached after only 
30 conditioning trials with 1.0-second 
interval, but that the curve was still 
rising after 50 reinforcements with a 
-5-second interval, 

These studies Support the notion that 
inhibition of the CR associated with 
the CS develops more readily and in 
fewer trials when long rather than 
short interstimulus intervals are used, 
Logical extension of this relationship 
leads to the rather obvious prediction 
than an infinite interstimulus interval 
would result in a peak in the CR ampli- 
tude curve at zero trials (conditioning 
would not occur at all). At the other 
extreme, a zero interval would require 
an infinite number of trials for the 
peak of the curve to be reached, which 
is in accord with Pavlov’s assertion 
that the UCS (here delivered simul- 
taneously with the CS) acts to over- 
come the inhibition normally being 
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attached to the CS. Thus it is not sur- 
prising to note that simultaneous con- 
ditioning fares poorly (McAllister, 
1953; White & Schlosberg, 1952). 


Inhibition of the UCR by the CS 


The extent to which the amplitude of 
the UCR may be attenuated by the CS 
during classical conditioning is best 
demonstrated by the technique of omit- 
ting the CS after several pairings of it 
with the UCS and presenting the UCS 
alone. Kimble (see Hilgard & Mar- 
quis, 1961) has described an eye- 
lid-conditioning experiment that rep- 
resents the first published report of 
this procedure. In a study by Kimble 
and Dufort (1956) on the effect of 
interpolated UCS presentations on eye- 
lid conditioning, a series of 20 CS- 
UCS trials was followed by 20 presen- 
tations of the UCS alone and, then, 20 
additional CS-UCS trials, An inci- 
dental observation in this study was 
that the size of the UCR, which had 
diminished gradually during the first 
20 CS-UCS trials (an effect previously 
noted by Hilgard & Marquis, 1961; at- 
tributed by them to reflex habituation), 
increased abruptly on Trial 21 when the 
UCS was not preceeded by the CS. 
The UCR remained at a high level un- 
til Trial 41 when the CS was again 
presented just before the UCS, and the 
UCR was again attenuated. This re- 
duction in size of the UCR in the pres- 
ence of the CS was not attributable to 
its having been elicited during a re- 
fractory period following the CR, since 
the reduction did not appear to be cor- 
related with the presence of the CRs. 
Indeed, the reverse was more likely, 
since individuals showing the fewest 
CRs were the ones who evidenced the 
most reduction in the UCR. 

Kimble and Ost (1961) reasoned 
that the reduction in UCR amplitude 
was due to some form of learned inhi- 
bition. To check on this, they studied 
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the relationship between UCR diminu- 
tion and recovery and the CS-UCS in- 
terval that had been used prior to omit- 
ting the CS. They found that the 
greatest degree of recovery of the UCR 
produced by omitting the CS occurred 
in a group of subjects conditioned with 
an interstimulus interval of .5 second. 
As a matter of fact, their curve of the 
relationship between the CS-UCS in- 
terval and degree of UCR recovery 
was almost indistinguishable from typi- 
cal curves of the so-called interstimulus- 
interval function in conventional condi- 
tioning studies (McAllister, 1953; 
Moeller, 1954; Prokasy, Fawcett, & 
Hall, 1962). Kimble and Ost con- 
cluded that a conditioned inhibitory 
process resembling Pavlov’s inhibition 
of delay must have been responsible for 
the reduction in UCR amplitude in the 
presence of the CS. 

UCR reduction in the presence of 
the CS has also been found in salivary 
conditioning (Kupalov, 1961). In his 
description of “pathological” irradi- 
ation of inhibition in one of his dogs, 
Kupalov noted that the UCR was 
completely eliminated during an ex- 
tended series of reinforced delayed 
conditioning trials. He used the term 
pathological because the dog eventually 
even failed to secrete saliva during 
normal feeding without any CS at all. 

Diminution of the UCR in GSR con- 
ditioning appears to be equally demon- 
strable. Lykken (1959) has reported 
that the amplitude of the GSR to an 
electric-shock UCS which was re- 
peated every 30 seconds was sharply 
reduced when a tone was presented 5 
seconds before the shock. His inter- 
pretation of this observation involved 
the cognitive-perceptual concept of 
preception, the subjects presumably 
being better able to estimate the time of 
delivery of the shock following a Şa 
second warning than following a 30- 
second warning. Viewed in this way 
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the UCR diminution and recovery 
phenomenon in GSR conditioning 
would seem to be a close relative of 
what Grings (1960) and Kimmel 
(1960) have referred to as a perceptual 
disparity response, except that the lat- 
ter phenomenon was observed in situ- 
ations in which the CS was changed 
rather than omitted or the UCS was 
changed and the CS kept intact. Evi- 
dence presented below will be used to 
clarify the cognitive-associative ques- 
tion and provide support for one posi- 
tion over the other. 

Pursuing the lead suggested by the 
Kimble and Ost (1961) study, Kimmel 
and Pennypacker (1962) gave 4, 8, or 
16 CS-UCS presentations to different 
groups of subjects prior to presenting 
the electric-shock UCS alone. They 
found a significant decrement in UCR 
amplitude (GSR) during reinforce- 
ment followed by a large increment 
when the CS was omitted. The 
amount of increase in UCR amplitude 
produced by omission of the CS was 
linearly related to the logarithm of the 
number of prior reinforcements; that 
is, a negatively accelerated growth func- 
tion was obtained. Following this, 
Kimmel and Pennypacker,? in an un- 
published experiment, found evidence 
that the degree of recovery of the 
UCR due to omission of the CS followed 
the conventional interstimulus-interval 
function, as had previously been found 
by Kimble and Ost (1961), although 
the statistical significance of this re- 
sult was only marginal (p <.10). 
The overall increase in UCR amplitude 
following omission of the CS was, how- 

ever, highly significant. . 

Since the results of these studies pro- 
vided further support for the proposi- 
tion that diminution of the UCR during 
classical conditioning is, itself, condi- 
tioned, Morrow (in press) studied the 

3Done while the author was a visiting 
assistant professor at Duke University, 1961. 
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possibility that it would be extinguish- 
able in the conventional sense. He 
gave three groups of subjects 3, 6, or 
12 CS-only extinction trials following 
paired CS-UCS trials and, after this 
extinction, presented a paired CS-UCS 
trial. All three groups made larger 
UCRs following extinction than they 
had before extinction (i.e., the condi- 
tioned UCR diminution had been ex- 
tinguished as hypothesized), a result 
not found in three control groups that 
had received the CS and UCS in an 
unpaired series of presentations prior to 
extinction and the CS-UCS test trial. 
No significant differences in amount of 
extinction of UCR diminution were 
found to be associated with the differ- 
ent numbers of extinction trials, a find- 
ing that led Morrow (in press) to 
speculate that extinction of this type of 
inhibition may be more rapid than is 
typical. 

The extent to which the CS controls 
diminution of the UCR in GSR con- 
ditioning was studied by Baxter 
(1966). Half of his conditioning sub- 
Jects received a 5-second tone paired 
with electric shock (delayed para- 
digm), while the other half received a 
l-second tone whose onset preceeded 
the shock by 5 seconds (trace para- 
digm). Thus the CS-UCS interval 
was 5 seconds in both groups. Two 
control groups were run in which the 


l- or 5-second tone and shock were 
delivered in an 


: unpaired sequence. 
Test trials on the UCS alone were 
presented following either conditioning 
or the control treatment, While di- 
minution of the UCR Was observed in 
both of the paired £roups, it occurred 
to a significantly greater extent in the 
delayed (5-second CS) conditioning 
group than in the trace (1-second CS) 
group. Similarly, omission of the CS 
led to greater recovery of the UCR 
in the delayed group than in the trace 
group. The control groups showed 
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only slight UCR reduction and no re- 
covery on test trials. These results 
support the conclusion that the actual 
physical presence of the CS leads to 
greater inhibition of the UCR than 
occurs in the presence of the “trace” of 
a CS. The slight reduction in UCR 
size that was observed in the control 
groups was attributed to habituation 
rather than an associative mechanism. 

Baxter’s (1966) results are rele- 
vant also to the question of whether 
UCR diminution is a simple condition- 
ing phenomenon or would be better 
understood in _ perceptual-cognitive 
terms. If the reduction in the size of 
the UCR results from the fact that the 
subject comes to “know” that the UCS 
follows the CS and, consequently, “pre- 
pares" to receive it when the CS is pre- 
sented, it would be reasonable to as- 
sume that this preparation involves a 
temporal parameter, namely, the time 
when the shock will be presented fol- 
lowing the CS. But in Baxter's ex- 
periment both conditioning groups had 
a S-second warning period from the 
onset of the CS to the onset of the UCS 
and both, under this reasoning, should 
have developed similar preparatory 
Sets. If any perceptual advantage ex- 
isted in the situation, it should have 
favored the trace conditioning group, 
since it was probably easier to estimate 
the 4 seconds from CS offset to UCS 
onset in the trace condition than it was 
to estimate the 5 seconds from CS onset 
to UCS onset in the delayed condition. 
It seems more likely that the diminu- 
tion phenomenon depends more upon 
the presence of CS initiated and con- 
trolled inhibition than upon the subject's 
preparation to receive the shock. 

A perceptual approach would “ex- 
plain" the increment in the size of the 
UCR produced by omission of the CS 
as a manifestation of the subject’s “sur- 
prise" at receiving the UCS alone. 
Again, however, Baxter's (1966) re- 
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sults do not support this position. On 
what basis would one expect the de- 
layed conditioning procedure to result 
in greater surprise at receiving an un- 
paired shock than occurred in the trace 
conditioning group? Presumably, the 
argument would be made that the trace 
paradigm is somewhat more similar to 
no pairing at all than is the delayed 
paradigm. This argument, however, 
since it places its emphasis upon the 
contiguity of CS offset and UCS onset, 
is a rather different perceptual theory 
than that usually proposed to handle 
the acquisition of conditioning, as ex- 
emplified in the preceeding paragraph. 
In addition to the groups already de- 
scribed above, Morrow (in press) ran 
an additional control group that re- 
ceived 12 unpaired shocks following 
conditioning, instead of the extinction 
trials of CS alone. As had previously 
been reported for the eyelid reflex by 
Kimble (see Hilgard & Marquis, 1961). 
reintroduction of CS-UCS pairing fol- 
lowing UCS-only trials. a change in 
procedure no less surprising than omis- 
sion of the CS, resulted in a drop in 
the size of the UCR. On the basis of 
the results obtained in these studies, it 
would appear that a simple condition- 
ing model provides a more adequate 
theoretical format than does the less 
parsimonious, albeit more "human," 
cognitive-perceptual interpretation. 


Instrumental Features of UCR Dimi- 
nution 

We have assumed that reduction in 
the size of the UCR in classical aver- 
sive conditioning is a manifestation of 
inhibition of delay and is thus under 
the control of the CS. When the CS 
is omitted, the inhibition is absent, and 
the UCR is larger. Two questions are 
suggested by this argument, First, is 
it possible that inhibition of delay di: 
velops as a result of differential rein. 
forcement of late as compared with 
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early CRs? Second, does the assumed 
differential reinforcement in question 
result from the fact that smaller UCRs 
reflect reduced noxiousness in the sub- 
ject’s experience? Empirical answers 
to both of these questions have re- 
cently been sought. 
What would we expect to happen to 
the UCR in a GSR conditioning ex- 
periment in which the CR is made in- 
strumental (i.e., leads to avoidance of 
the shock)? If inhibition of delay is 
the result of differential reinforcement 
of late as compared with early respond- 
ing in the CS-UCS interval, we would 
have to predict that UCR recovery 
when the CS is omitted would not oc- 
cur appreciably in an avoidance situ- 
ation, since it would be more adaptive 
to make an early CR and avoid the 
shock than it would to develop inhibi- 
tion of delay and receive a possibly less 
noxious shock. 

Kimmel (1965) has recently de- 
scribed a GSR conditioning experiment 
in which the subjects were run in 
yoked pairs such that whenever the 
avoidance subject of a pair made a 
CR of criterion magnitude both sub- 
jects received no shock on that trial. 
Half of the subjects were run with a 
5-second CS and half with a 1-second 
CS, but all subjects had a 5-second CS- 
UCS interval. Following the acquisi- 
tion trials, a test trial on the UCS alone 
was given to all subjects. The avoid- 
ance groups showed a slight decrement 
in the size of the UCR when the UCS 
was delivered alone following condi- 
tioning, while both the delayed and the 
trace yoked classical groups showed 

the expected increment, although the 
difference between avoidance and 
yoked control groups was statistically 
significant only in the delayed para- 
digm. These results are in accord 
with the assumption that the CS ac- 
quires its inhibitory properties only 
in a classical conditioning situation, but 


not when the CR is made instrumental. 
When an early CR results in shock 
avoidance, omission of the CS at the 
end of the training does not result in an 
increment in the size of the UCR. Also 
pertinent to this argument is the recent 
observation (Kimmel & Baxter, 1964) 
that inhibition of the CR also fails to 
occur when an avoidance paradigm is 
used in GSR conditioning. 

Does the fact that a smaller GSR 
is made to the shock when it is paired 
with the CS than when it is presented 
alone mean that the shock is perceived 
as less intense? Kimmel and Schultz 
(1964) had their subjects judge the 
intensity of electric shocks ranging 
from .5-3.5 milliamperes at the same 
time that GSRs were measured. Both 
judged intensity and GSR magnitude 
were significantly positively related to 
shock intensity, but the correlation be- 
tween GSR magnitude and judgments 
was found to be significantly positive 
in only one of the nine subjects that 
were observed, when the effect of 
shock intensity was partialed out sta- 
tistically. Since a significant negative 
correlation was found in the data of 
one other subject, the results of this 
study surely do not provide support 
for the assumption that the perceived 
intensity of the shock holds the ex- 
planatory key to a differential rein- 
forcement interpretation of UCR dimi- 
nution and recovery, 


Summary and Conclusions 


The studies reviewed in this paper 
support the conception that the acqui- 
sition of classical conditioning involves 
a distinctly negative component side by 
side with its more well-known positive 
characteristics. After the formation of 
an initial association of an excitatory 
nature, the CS gradually begins to de- 
velop inhibitory properties that result 
in attenuation and delay of the CR and 
even its ultimate total loss under con- 
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tinued reinforcement, as well as dimi- 
nution of the UCR in the presence of 
the CS. The latter is revealed particu- 
larly by the technique of omitting the 
CS and comparing the resulting UCR 
with UCRs observed (both before and 
after) when the CS is present. UCR 
recovery has been shown to vary as a 
function of the CS-UCS interval used 
during training (Kimble & Ost, 1961), 
the number of reinforced trials prior to 
omitting the CS (Kimmel & Penny- 
packer, 1962), and whether a delayed 
or trace conditioning paradigm was 
used prior to CS omission (Baxter, 
1966). . The proposition that UCR 
diminution and recovery depend upon 
a conditioning or associative process is 
further Supported by the observation 
that it does not occur appreciably in an 
unpaired sequence of CSs and UCSs 
(Baxter, 1966; Morrow, in press), 
and by the finding that standard ex- 
tinction procedures result in its loss 
(Morrow, in press). Several features 
of these results do not accord with the 
notion that they are due to a cognitive- 
perceptual type of process, although 
further research on this issue is surely 
not precluded. 

The possibility that CS-controlled 
inhibition is related to an instrumental 
differential-reinforcement process re- 
ceives some support from the fact that 
CR inhibition does not occur in an 
avoidance conditioning situation and 
that UCR recovery when the CS is 
omitted is similarly absent in avoidance 
GSR conditioning while present in a 
yoked classical control condition. That 
the mechanism of such reinforcement 
is to be found in the perceived noxious- 
ness of the UCS, while not supported 
in one study of the relationship be- 
tween GSR magnitude and judged in- 
tensity of electric shock, must still be 
considered a possibility and should be 
the object of future research in this 


area, 


239 


If they do nothing else, the results 
of this line of investigation can be 
taken to mean that a new view of clas- 
sical conditioning deserves serious con- 
sideration. Rather than a purposeless 
ritual in which a "passive" subject has 
things happen to him but can contribute 
little of an instrumental nature himself, 
classical conditioning may be viewed 
fruitfully as an "active" or "dynamic" 
interaction between the organism and 
its environment. While the pejorative 
use of such terms as passive and "auto- 
matic" has little scientific significance, 
the possibility that the functionalistic 
approach described in this paper may 
lead us closer to the truth cannot be 
foreclosed merely because there is noth- 
ing “wrong” with the traditional view. 
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REPLY TO PAUL THOMAS YOUNG 


FRANCIS W. IRWIN 


University of Pennsylvania 


Young (1966) claimed that there is evidence against Irwin’s (1958) 
assertion that if an organism exhibits a discrimination it also exhibits 
a preference. The claim seems to be due to his not taking into account 
that, in any experimental determination, the objects of discrimina- 
tion are not identical with the objects of preference. 


In a recent paper Young (1966, p. 74) 
writes: 


If an animal shows a preference between 
two fluids, we can be sure that he is capable 
of discriminating between them, but the 
oe of this proposition is not necessarily 
rue. 


To this he appends the following foot- 
note: 


Irwin. (1958) wrote: “By our definition, an 
organism cannot exhibit a preference without 
exhibiting a discrimination, and conversely 
[p. 162].” The first part of this statement 
is certainly true, but the converse is false. 
There is plenty of evidence that animals 
and human beings discriminate without re- 
vealing a preference. 


This flat denial of a main point of my 
paper (Irwin, 1958) by a well-known 
student of preferences requires an answer. 
The point itself is not trivial, even if its 
implications for method are put aside. 
Discrimination has long been recognized 
as a fundamental psychological concept; 
my conclusion, based in part upon the 
assertion that Young denies, was that the 
motivational concept of preference is 
equally fundamental. 

If my statement meant what Young 
seems to have taken it to mean, I would 
indeed be convicted, not of going against 
the facts—the question is one of choice 
of defining criteria, not one of fact—but 
of adopting criteria that were, to say the 
least, esoteric. I agree with Young's 
implication that an animal may discrimi- 
nate between two fluids without prefer- 
ring one to the other, but his quotation 
from my paper does not contradict this. 
What the whole argument of my paper 


leads to is the implication that if an ani- 
mal exhibits a discrimination between two 
fluids, then, because of the relations be- 
tween the criteria offered for discrimina- 
tion and preference, it also exhibits a pref- 
erence for some outcome over some other 
outcome. Indeed, the next two sentences 
of my article read: 

It is important to recognize, however, that 
in a given experimental determination, the 
objects of preference are never identical with 
the objects of discrimination. The former 
are features of the outcomes of acts, while 
the latter are features of the situations that 
exist before the organism acts [p. 162]. 


I regret Young’s misunderstanding of 
my paper all the more because I believe 
that he and I are allied closely in our 
views of motivational problems. Were 
he to perform an experiment to deter- 
mine whether animals would discriminate 
between some pair of discriminanda, I 
am confident that he would provide dif- 
ferential outcomes ("incentives")—per- 
haps a highly palatable sucrose solution 
and an empty vial—that he was already 
sure would be highly different in prefer- 
ence value for the animals. If they 
displayed the discrimination, they would 
necessarily also show that his judgment 
of the preference was correct. 
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ON NOISE IN THE NERVOUS SYSTEM! 


LAWRENCE R. PINNEO 


Delta Regional Primate Rescarch Center, Tulane University 


Treisman and Hebb have suggested that "spontaneous," "random," or 
"background" activity in the nervous system constitutes "noise" in 
discrimination and learning; that is, this type of activity has no func- 
tional value to the organism. This paper attempts to show that tonic 


activity, a term including all of the types of activity listed above, is 
rather the functional substrate of the brain. Examples are cited for 
the skeletal and autonomic motor systems, the primary sensory systems, 
and the diffuse ascending and descending reticular activating systems 
to show that the tonic activity in the entire brain enters into all 
discrimination and learning, and, in agreement with Lashley, represents 


the neural basis of behavior. 


Two recent theoretical papers have ar- 
gued that "noise" in the nervous system 
has interfering effects in discrimination 
and learning. In one, Treisman (1964) 
has suggested that three sources of noise 
limit discrimination: (a) the irreducible 
physical variability of the stimulus, (b) 
the "spontaneous" neural background ac- 
tivity to which a stimulus is added, and 
(c) the neural noise arising from varia- 
tion in the pathways transmitting mes- 
sages centrally. Based on these three 
sources of noise, Treisman has worked 
out a complicated signal-detection theory 
to explain the form of the Weber function 
for visual intensity discrimination and 
for other sense dimensions. 

In the other paper, Hebb (1961) raised 
the question of the interfering effects of 
random activity in the nervous system 
during learning of a specific task. By 
learning he meant the modification of 
the direction of transmission in the cen- 
tral nervous system (CNS) at the syn- 
apse. He pointed out that a large brain 
such as a mammal’s has many more 
neurons present than are necessary for 
learning a specific task. Therefore, ran- 
dom activity in the neurons not involved 
in learning the task constitutes noise, 
which Hebb felt must interfere with 
learning. 


YThis work was supported by National 
Institutes of Health, United States Public 
Health Service Grants FR-00164-03 and 
N B-04951-01. 


These two papers illustrate a widely 
held misconception of brain function, 
namely, that the Spontaneous, random, or 
background discharge of neurons has lit- 
tle or no functional value; that is, this 
activity has no information value for the 
organism and therefore is noise in the 
communications sense of the word. In 
this theoretical note I attempt to sh 
that the neural noise to which Treis 
and Hebb refer (leaving out Treisman's 
first category) is not noise at all, that this 
neural activity does not limit discrimina- 
tion nor interfere with learning, and that 
in fact this activity is essential to dis- 
crimination and learning, 

Arduini (1963) has suggested that 
there are fundamentally two types of 
nervous discharge, and he has carefully 
defined them; his definitions will be used 
in this paper. Borrowing from the ter- 
minology applied to muscle activity, Ar- 
duini defines "phasic" activity as a tran- 
sient increase or decrease in impulse 
firing rates of neurons that is time locked 
to a particular stimulus. A familiar ex- 
ample is the evoked response, “Tonic” 
activity on the other hand is nontransient, 
or continuous, neural discharge in which 
the average firing rate is random and 
constant and is not time locked to a 
stimulus. So.called Spontaneous activity 
(a dubious term at best since it implies 
the discharge of neurons without benefit 
of influences external to the neurons), 


ow 
man 
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background activity, random discharge, 
and the like, are familiar examples of 
tonic activity. 

Tonic activity may conveniently be 
considered as the functional substrate of 
the nervous system. As Lashley (1951) 
has pointed out, neurological theory has 
been dominated by the incorrect belief 
that neurons of the CNS are inactive for 
the greater part of the time. According 
to this view, neurons are always in the 
resting state where they are linked in the 
relative isolation of reflex arcs until a 
particular reaction for which they are 
specifically associated activates them. Yet, 
no matter where in the CNS one places 
an electrode, and no matter what the con- 
dition of the organism as long as he is 
alive, a steady discharge of neurons may 
always be recorded. In general, level of 

arousal” or “activation” increases with 
the level of tonic activity throughout the 
brain. And it is against this tonic level 
that sensory messages are received and 
motor acts achieved. As Lashley (1951, 
p. 131) has so well expressed it, “all the 
cells of the cerebrospinal axis are being 
continually bombarded by nerve impulses 
from various sources and are firing regu- 
larly, probably even during sleep.” And, 
in the intact organism, behavior is the 
result of interaction of this background 
of excitation with input from any desig- 
nated stimulus [p. 112].” 
a The nonspecific attributes of tonic ac- 
tivity are now well documented following 
the experiments of Moruzzi and Magoun 
(1949) on the cortical arousing effects 
pe stimulation of the brain-stem reticular 
no and the descending tonic in- 
icem CS lower brain stem upon 
SEL saa a shown by Lindsley, 
] 
tonic motor CP ees se md ded 
Bac itself increases level 
ation and modifies behavior (Pin- 
neo, 1961). But tonic activity also has 
been shown to have specific functional 
value in the skeletal and autonomic motor 
systems for the regulation of movement 
and the function of visceral organs. Thus. 
Sherrington (1906) observed that at the 
level of motor units the intensity of a 
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mechanical effect resulting from a normal 
tonic discharge depends in large part 
upon the frequency of tonic impulses. 
Tower (1949) has carried this concept a 
little further by suggesting that the tonic 
nature of the discharging motor neurons 
is the expression of a central excitatory 
state (cf. Sherrington, 1906), modulated 
by a phasic or episodic function which 
appears as a specific contribution to 
individual acts. 

Stimulation of central tracts and nuclei 
concerned with regulation of the skeletal 
and autonomic motor systems illustrates 
the fine control that tonic activity in these 
systems exerts (see the many chapters 
devoted to these systems in Field, Ma- 
goun, & Hall, 1960). Almost countless 
locations have been found (for example, 
in the pyramidal and extrapyramidal sys- 
tems, cerebellum, and brain stem) in 
which stimulation with gross electrodes 
over short and long periods of time ap- 
pears to initiate and maintain highly inte- 
grated and complex patterns of muscle 
activity. Frequently, it is found that the 
extent and complexity of movement is 
directly related to current strength. What 
moves, however, depends upon the loca- 
tion of the electrode. Since at these 
levels it is highly unlikely that fibers are 
excited selectively and in the proper 
temporal sequence to produce such com- 
plex and organized movements, it must 
be concluded that it is the increase in 
tonic level brought about by stimulation 
of the descending tracts that controls the 
motor activity. The argument (Patton & 
Amassian, 1960) that many of these 
movements are reflex results of afferent 
stimuli by spread of current from efferent 
tract stimulation to nearby afferent tracts 
has some validity. However, stimulation 
of efferent pathways alone can produce 
the same highly organized movements 
(Dow & Moruzzi, 1958; Pompeiano, 
1959). : 

In our laboratory we have repeated anc 
extended some of these experiments and 


have been able to completely reproduce a 


series of highly integrated movements in 


the squirrel monkey (Saimiri sciureus) 
by simultaneous and successive stimula- 
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ion of several critical nuclei in the cere- 
bellies and brain stem (Pinneo, in press 
aa ee ee eee 
air, we explored v: 1 
er p s nuclei as the interpositus, 
Deiter's, fastigial, and vestibular, and sev- 
eral other nuclei and tracts of the caudal 
brain stem, to determine the type of 
movement that could be elicited at each 
locus and the optimum electrical parame- 
ters for obtaining the movement. Then, 
by chronically implanting up to six pairs 
of electrodes in chosen sites, and by the 
proper ordering of stimulation at each 
electrode pair with respect to the other 
electrodes, fine coordinated movements 
were reproduced at our desire. 

These effects were obtained whether 
the motor cortex concerned with the mus- 
cular groups in question was intact or 
not. For example, destruction of the 
motor cortex for the right arm and hand 
of the squirrel monkey produces paralysis 
in that limb, as in man. But, by proper 
placement and stimulation of electrodes in 
the homolateral interpositus nucleus, we 
were able to close the fingers (as in a 
grasp around an object) and to bring the 
hand rapidly and smoothly to the mouth; 
this could be repeated as often as desired. 
Other possible combinations included ex- 
tension of the arm outward (as in reach- 
ing), backward, and to the side. Other 
electrode locations were found that pro- 
duced similar movements in other limbs, 
opening and closing of the mouth (as in 
mastication), movement out and in of the 
tongue (as in licking), and control of 
rate and amount of respiration, and rate 
and volume of cardiac output. In all of 
the locations examined, the mechanical 
effect was proportional to current strength 
in the stimulating electrode (and there- 
fore proportional to the level of tonic 
activity in the tracts or nuclei involved), 
so that very finely graded movements 
were obtainable singly or in combination, 

The functional value of tonic activity 
is not limited to the skeletal and auto- 

nomic motor systems. In the primary 
visual pathway of mammals, tonic ac- 
tivity alone is recorded under maintained 
states of darkness or light (Arduini & 
Pinneo, 1961, 1962, 1963b; Granit 1955). 
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In the dark-adapted state a large number 
of retinal ganglion cells, perhaps most, 
maintain a relatively high rate of tonic 
discharge (Arduini & Cavaggioni, 1961; 
Granit, 1955). With the onset and offset 
of a prolonged light stimulus, “on” and 
“off” retinal bursts provide a momentary 
phasic response throughout the visual 
system whose size is proportional to the 
stimulus intensity. Shortly thereafter, 
however, and for as long as the intensity 
of the stimulus remains constant, the 
activity in the primary visual system is 
tonic (Arduini & Pinneo, 1961, 1962, 
1963b). Furthermore, the level of the 
tonic activity with a steady light was 
found by Arduini and Pinneo (1962) to 
be inversely related to the intensity of 
the light by a power function of a form 
quite similar to the brightness function 
described by Stevens (1962) for humans. 

Recently, Brooks (1964) has confirmed 
that the level of tonic activity in the 
primary visual System is the probable 
neural basis of brightness vision. 


s Using 
the classical AI/T versus I paradigm, 
Brooks trained Squirrel monkeys to make 
brightness 


discriminations over 4 log 
t and then re- 


geniculate nucleus under threshold co; 
ditions. Brooks found that (a) the 
Weber function of the squirrel monkey 
was very similar to that of man; (b) 
tonic and 
illumination of the retina as Arduini and 


Pinneo (1962, 1963b) had shown for the 
cat; and 


Treisman (1964) 
dark adaptation 
is greatest and 
of high illumination when Sensitivity is 
Poorest. It is difficult, therefore, to see 
how this noise interferes with discrimi- 
nation, 

Perception of flicker fusion and the 
brightness of an intermittent light at 
critical fusion frequency (CFF—the Tal- 
bot brightness) also have been shown to 
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depend upon the presence and level of 
tonic activity in the visual system (Ar- 
duini & Pinneo, 1963a). Recording from 
electrodes implanted in human patients 
for therapeutic reasons, Pinneo (in press 
—b) and Heath found that CFF, as 
determined by verbal report, occurred 
at that frequency of intermittent light 
where all or most phasic activity in the 
visual System had disappeared and only 
tonic activity remained. Furthermore, 
they found that the level of tonic ac- 
tivity at CFF was the same as that 
produced by a steady light of the same 
intensity as the average intensity of the 
flickering light. Thus, the flicker was 
Perceived as steady because the activity 
of the visual system responded as though 
the light were steady; that is, the dis- 
charge was tonic, and the level of tonic 
activity was inversely proportional to the 
power of the average intensity. 

Functional relations also have been 
found between the level of tonic activity 
in other primary sensory systems and 
environmental stimuli, notably the audi- 
tory pathway (Galambos & Davis, 1944; 
Galin, 1964) and the temperature or- 
(Hensel, 1952; Zotterman, 1953). 

“hough many other examples could be 
cited, these and the others described 
above should suffice to illustrate that tonic 
activity in the nervous system has very 
specific functional value and therefore is 
not merely noise. 
mie of the mode of action of phasic 
eet sage. in the CNS as demon- 
eei ha e experiments discussed sug- 
guides zem phasic activity selects and 
us RS ee while tonic activity con- 
Her oli neness and extent of the re- 
los 5 ce d sequence from stimu- 
bris CE Se, the distribution of the 

ot activity throughout the CNS 
may be described as follows: 

l. In sensory systems hasi ivi 
is the response to fratar. nE 
the level of tonic activity signals the 
average intensity of background stimuli 

2. Both phasic and tonic activity in 
sensory systems modulate the existing 
tonic discharge in the brain-stem reticular 
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formation. In this case the state of 
arousal is modified by the level of tonic 
activity in the ascending reticular acti- 
vating system; postural tonus is changed 
by the tonic discharge of the descending 
reticular activating system (motor set to 
respond?), while the phasic discharges 
probably mediate attention or orienting 
responses. 

3. In the skeletal and autonomic motor 
systems the phasic activity in specific 
motor nuclei and tracts selects and initi- 
ates movement and modifies visceral func- 
tion while tonic activity provides the fine 
control and extent of movements and level 
of visceral function. 

4. Feedback of phasic and tonic activity 
in all systems—receptor, sensory, motor, 
autonomic, and activating—to and from 
the cerebral cortex and subcortical nuclei 
provides the continued sequence of re- 
sponses necessary for completion of an 


act. 


It is evident from this analysis that 
stimuli are always superimposed upon the 
tonic activity of the entire brain, and 
that the perception and “meaning” of a 
stimulus must depend upon the relative 
amounts of tonic activity in the various 
parts of the nervous system. Discrimina- 
tion, generalization, and learning occur 
with repeated exposures under different 
and similar circumstances with different 
and similar patterns of tonic activity 
throughout the nervous system. This is 
well illustrated by the fact that percep- 
tion, discrimination, and learning depend 
upon and may be modified by changes 1n 
level of activation, patterns of muscle 
activity, and states of visceral activity 
and the activity in other sensory systems, 
all primarily tonic in nature. 

In his thoughtful evaluation of cere- 
bral organization and behavior, Lashley 
(1958) has given to tonic activity the 
major role of the neural basis of mind. 
Though reminiscent of Hebb's m 
concept of the “cell assembly and P. p^ 
sequence," in that Lashley organizes the 
brain into a large number of systems 
where each system consists of the traces 
of a number of habits or memories, 
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Lashley still ascribes mental function to 
the tonic activity of these trace systems 
rather than to a particular set of connec- 
tions between specific neurons and their 
patterns of activity. This concept is not 
only consistent with what is now known 
of the physiology of the brain, but is also 
the only explanation that will satisfy the 
efficiency of behavior following brain 
damage (e.g. Lashley's laws of “mass 
action" and “equipotentiality” ) and the 
many controversial neurological interpre- 


tations of events in perception and 
learning. 
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ON REMOTE ASSOCIATIONS AND THE INTERPRETATION 


Since Ebbinghaus’ 


OF DERIVED-LIST EXPERIMENTS? 


DAVID T. HAKES anp ROBERT K. YOUNG 


University of Texas 


Serial learning is generally assumed to involve the formation of direct 
and remote interitem associations. Such an interpretation appears to 
predict positive transfer in the derived-list paradigm, a prediction 
apparently supported by Ebbinghaus’ finding of savings in this 
paradigm. It is pointed out that: (a) The theory actually predicts 
negative transfer in this paradigm due to interference from associations 
which were correct in the 1st list and are incorrect in the 2nd list; 
and (b) because of the control condition used, Ebbinghaus’ findings 
indicate negative or 0 transfer. Recent findings of negative or 0 
transfer are consistent with the theory and with the Ebbinghaus data 
when both are correctly interpreted. 


(1885) first ex- 


periments, the dominant conception of 
serial learning has been that it involves 
the formation of interitem associations 
between the adjacent items of the list. 
Thus the learning of a serial list, A-B- 
C-...-N, is hypothesized to involve the 
formation of associations A-B, B-C, etc. 
In addition, serial learning theorists have 
generally assumed that remote as well as 
direct interitem associations are formed 
(cf. McGeoch & Irion, 1952, Ch. 4). 
Formation of remote associations is as- 
sumed to occur in both the forward (e.g. 
A-C, A-D) and backward (e.g, B-A, 
C-A) directions, and their strength is 
assumed to decrease as the distance be- 
tween the items increases, forward asso- 
ciations being stronger than backward 
associations of the same degree of re- 
moteness. . f 
Through these assumptions concerning 
remote associations, it is possible to de- 
rive an account of the bowed serial- 
position curve typically found during se- 
rial learning (Bugelski, 1950). The 
greater difficulty of items in the middle 
of the list is attributed to the greater 
number and strength of remote associa- 
tions for these items than for those at 
the ends of the list, and the asymmetry 
of the serial-position curve is attributed 


1 Preparation of this note was supported 
in part by National Science Foundation 
Grant GB-979. 
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to the greater strength of (and, hence, 
greater interference from) forward than 
backward associations. While some theo- 
rists have suggested modifications of such 
a theory (eg., Ribback & Underwood, 
1950), few have been led to question the 
assumption of remote associations. Ac- 
ceptance has been so widespread over 
Such a period of time that Slamecka 
(1964) has suggested that it might be 
referred to as the “doctrine” of remote 
associations. 

A major early source of support for 
the remote-association assumption was 
the results of Ebbinghaus’ (1885) studies 
of the derived-list paradigm. Ebbinghaus 
found savings in second, derived-list 
learning which were greatest for the first- 
order condition and decreased as the 
order of derivation (ie, degree of 
remoteness) increased. The smallest 
amount of savings was obtained when the 
derived list was a random rearrangement 
of the items. These results have gener- 
ally been interpreted as demonstrating 
the existence of remote forward associa- 
tions and as indicating that these decrease 
in strength as the degree of remoteness 
increases, 

In view of the methodological short- 
comings of Ebbinghaus’ experiments (cf. 
Hakes, James, & Young, 1964; Slamecka, 
1964), it is somewhat surprising that 
their results and interpretation have been 
so widely accepted without serious ques- 
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tion or adequate replication. It may well 


. be that their acceptance resulted from 


their apparent consistency with prevailing 
Serial learning theory. Three early stud- 
les of the derived-list paradigm (Cason, 
1926; Hall, 1928; Lepley, 1934) yielded 
results which were not very supportive of 
Ebbinghaus' data. But because of meth- 
odological problems, these studies have 
been accorded little weight in the litera- 
ture (Slamecka, 1964, p. 63). 

Irion (1946) did report that the ran- 
dom-order derived-list condition (the one 
used by Ebbinghaus as a control condi- 
tion) yielded negative transfer and retro- 
active inhibition. In commenting upon 
this finding, McGeoch and Irion (1952) 
note that: 


This interference cannot serve except to re- 
red remote association effects. The posi- 
n evidence for remote associations is not a 

nction of such interference, although 
remote associations, themselves, may be a 
^: of positive transfer of training [p. 


This would appear to represent an at- 
tempt to reconcile Ebbinghaus’ finding 
of savings with Irion’s finding of inter- 
Ies but something seems to be lack- 

g. 

Recently, a number of experiments have 
been performed on the derived-list para- 
digm using more adequate methodology 
than the early studies. Hakes et al. 
(1964), studying first, second-, and 
third-order derived-list conditions, ob- 
tained negative transfer which was great- 
< for the first-order condition and 
ee i pi the order of derivation in- 
(1965 . oung, Hakes, and Hicks 
(1965), studying 8-, 12-, and 16-item 
lists in the first-order derived-list condi- 
tion, found strong evidence of negative 
transfer for the 12- and 16-item conditions 
and nonsignificant positive transfer for 
the 8-item condition. 

Slamecka (1964) has performed a 
series of derived-list experiments involv- 
ing the first-order condition and a condi- 
tion involving, within a single list, zero- 
first-, and second-order conditions. Gom 
paring these conditions with a random- 
order condition, Slamecka reported no 


evidence of faster learning of the derived 
lists except in one case, involving the 
first-order condition, in which the sub- 
jects were explicitly told the relationship 
between first and second lists. 

It is not the question of the empirical 
results of derived-list studies which is of 
interest here. This question would appear 
to be answered adequately by the experi- 
ments just discussed. The question is, 
rather, how these results are to be inter- 
preted. With reference to this, two ques- 
tions arise, The first is concerned with 
the relation between Ebbinghaus' results 
and those of more recent experiments. 
The immediately apparent conclusion is 
that the finding of negative transfer is 
incompatible with Ebbinghaus' results. 
As Hakes et al. (1964) state: "It would 
appear, then, that the present experiment 
provides no support for Ebbinghaus' data 
or for the theory of remote associations 
based on these data [p. 513]." On closer 
inspection, however, such a conclusion 
appears not to be justified. 

It has generally been assumed that Eb- 
binghaus' results, reported in terms of 
savings, showed positive transfer (cf. 
McGeoch & Irion, 1952, p. 94). Since 
Irion (1946) has shown that the random- 
order condition (Ebbinghaus’ control 
condition) yields negative transfer, Eb- 
binghaus' findings cannot be considered 
to show positive transfer: The savings 
in the derived-list conditions were greater 
than for a negative-transfer condition and 
might well represent decreased negative 
transfer rather than positive transfer. 
Thus it would appear that the inconsist- 
ency between the savings found by Eb- 
binghaus and the negative transfer found 
by Hakes et al. (1964) and by Young et 
al. (1965) is more apparent than real. 

Slamecka's results may be interpreted 
in the same manner. His general finding 
is that the derived-list condition is not 
superior to the random-order condition 
and is, thus, a negative-transfer condition. 
It is of interest to note that when sub- 
jects are aware of the interlist relation- 
ship in the first-order derived-list para- 
digm the amount of negative transfer is 
reduced. This condition would appear 
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to be directly comparable to the Ebbing- 
haus experiments, as Ebbinghaus used 
himself as the only subject and must have 
been aware of the relationship between 
lists. The preponderance of evidence 
suggests, then, that the derived-list para- 
digm yields negative transfer. Instruction 
can reduce the amount of transfer, but 
whether under these conditions positive 
transfer occurs is indeterminate since 
neither Ebbinghaus nor Slamecka used 
appropriate control conditions, 

'The second, and major, issue of inter- 
pretation concerns the relationship of 
these results to an interitem theory of 
serial learning. Initially, it would seem 
that the results are inconsistent with such 
a theory, at least with one which assumes 
remote associations. Again, closer in- 
spection reveals that this is not the case. 

It has been assumed that the formation 
of remote associations during first-list 
learning should facilitate the learning of 
a second list in which the remote asso- 
ciations become the direct interitem as- 
sociations, That is, during second-list 
learning, the previously incorrect, remote 
associations become direct, correct asso- 
ciations, and the strength which they have 
accrued during first-list learning is the 
basis for positive transfer. Such an 
interpretation of the remote-association 
hypothesis is, of course, consistent with 
the erroneous interpretation of Ebbing- 
haus’ results as indicating positive trans- 
fer. 

Such an interpretation of the hypothesis 
is, however, incomplete. In the learning 
of a derived second list, the previously 
correct, direct associations are no longer 
correct and are instead remote associa- 
tions. These associations must, at the 
end of first-list learning, have been 
stronger than any remote associations in 
order for the first list to have been 
learned. During  second-list learning, 
these strong, and now incorrect, associa- 
tions should, according to an interitem- 
association theory, be a source of power- 
ful interference and negative transfer, 
Thus it would appear that a theory of 

direct and remote interitem associations 
considers the derived-list paradigm to be 
a negative-transfer paradigm. Positive 
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transfer can be predicted only if inter- 
ference from correct first-list associations 
is ignored. It should be noted that when 
an analogous paired-associate learning 
situation is encountered (Young & Jen- 
nings, 1964), in which associations in- 
correct in the first list become correct 
in the second, negative transfer is ob- 
tained. 

It would appear, then, that contrary to 
the customary interpretation, the remote- 
association hypothesis is consistent with 
the obtained negative transfer in the 
derived-list paradigm. The confusion 
which has arisen has two sources: the 
misinterpretation of the Ebbinghaus re- 
sults and an improper derivation from the 
hypothesis. When both errors are made, 
Ebbinghaus' results appear to be con- 
sistent with the hypothesis. The in- 
consistency becomes apparent only upon 
consideration of the recent experiments 
which measured transfer rather than 
savings. 

The one inconsistency which remains 
is that the remote-association hypothesis 
appears to predict that the amount of 
negative transfer should increase as the 
degree of remoteness increases. This, of 
course, is the result which Ebbinghaus 
obtained. The one recent study which 
yields data relevant to this prediction 
(Hakes et al., 1964) does not support 
the prediction. No explanation for this 
is readily available, 

Finally, it should be noted that the 
empirical results which have been dis- 
cussed are also consistent with other in- 
terpretations of serial learning, 
an ordinal 
1962). 


i such as 
position theory (cf, Young, 
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A COMMENT ON INTERMEDIATE SIZE DISCRIMINATION 
AND ADAPTATION-LEVEL THEORY: 


DONALD A. RILEY, MARIAN SHERMAN: Ax» JOHN P. McKEE 
University of California, Berkeley 


Zeiler's (1963a, 1963b) adaptation-level (AL) interpretation of trans- 
position in children following training on an intermediate size problem 
is compared with Stevenson and Bitterman's (1955) conclusion that S 
learns to respond to both relational and absolute properties of the 


stimulus. 


Based both on the analysis of Zeiler's data and new data it 


is concluded that while it is not necessary to assume a changing to AL 
to account for the facts of intermediate size learning and transfer, it is 
necessary to assume that S's response in training and test is determined 


both by the absolute and relative properties of the stimuli, 


The degree 


to which these properties control behavior depends on factors such as 
background characteristics and instructions. 


The purpose of this note is threefold. 
First Zeiler's (1963b) account of trans- 
position after intermediate size training 
in terms of adaptation level (AL) will 
be compared with an interpretation sug- 
gested by Stevenson and  Bitterman 
(1955). Second, the authors will com- 
ment briefly on Zeiler's data and their 
relevance to both positions. Finally some 
further research on the intermediate size 
problem which was designed to replicate 
and extend Zeiler's work will be pre- 
sented. 

Zeiler's (1963b) extension of AL 
theory differs from James' (1953) similar 
extension mainly in his assumption that 
the cue to which the subject (S) learns 
to respond during training is the ratio 
of the value of the correct training stimu- 
lus to the training AL. According to 
this hypothesis, transposition occurs be- 
cause as soon as S sees the test stimuli 
his AL changes, and he then chooses 
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that stimulus whose ratio to the new AL 
is nearest the ratio of the original posi- 
tive stimulus to the training AL. By 
the assignment of appropriate weights to 
the training stimuli and the test stimuli, 
Zeiler is able to account for a variety of 
results from previous intermediate size 
experiments. In addition to rationalizing 
previous work, Zeiler (1963a, 1963b) has 
presented a large amount of new informa- 
tion which he argues js consistent with 
the AL interpretation, but not with pre- 
vious analyses of transposition, 


A Comparison or Two INTERPRETATIONS 


Stevenson and Bitterman (1955) in- 
terpreted the results of their experiment 
on transposition following intermediate 
size training as evidence that Ss learned 
to respond to both relational and absolute 
properties of the stimuli, Although Ss 
showed relational learning by transpos- 
ing in the near test, the observed de- 
cline in transposition in a far test was 
interpreted as the consequence of the 
large difference between the absolute val- 
ues of the stimuli in training and testing. 
Since Stevenson and Bitterman's report, 
a number of other papers in the last few 
years have identified situational factors 
that influence the degree to which $5 
respond to relational or absolute proper- 
ties of the stimuli, For instance, Baker 
and Lawrence (1951) have shown the 
importance of temporal factors, and Riley: 
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Goggin, and Wright (1963) have shown 
the importance of variations of the spa- 
tial arrangements of the stimuli. 
Stevenson and Bitterman’s position ac- 
counts for the amount and direction of 
transfer in terms of (@) what is asso- 
ciated during training, and (b) the simi- 
larity of training and test situations. This 
position differs from the AL account 
primarily in one way. It does mot as- 
sume that transposition (or any other 
outcome of a transfer test) occurs because 
the condition of S changes while he is 
looking at the stimuli prior to making 
a choice. The AL account of transposi- 
tion, as we understand it, attributes trans- 
position in the intermediate size problem 
to a change in the perceived size of the 
stimuli, Stevenson and Bitterman's posi- 
tion denies neither the existence of such 
Sensory-adaptation processes nor that un- 
der very special circumstances such shifts 
in the state of S might produce transpo- 
sition. It does assert that its two as- 
sumptions are both necessary and suffi- 
cient for dealing with the facts revealed 
by transfer tests after training on the 
intermediate size problem and that the 
additional assumption of a change in 
adaptation level is superfluous. 


ZeEILER’s DATA 


Zeiler's (1963b) article considers two 
kinds of evidence. One is the evidence 
from previous intermediate size experi- 
ments, Zeiler suggests an AL interpreta- 
tion that is based on an ad-hoc weighting 
of the stimuli presented during training 
and testing. Such ad-hoc capabilities are, 
of course, only minimally convincing evi- 
dence for the validity of a theory. 

More interesting are the results from 
Zeiler’s own intermediate size experi- 
ments. His stimuli were flat, }-inch 
Masonite squares with indentations on 
the underside to hide the reward object. 
The smallest, Stimulus 1, was 4 square 
inches, and the squares increased in area 
by a factor of 1.4 to the largest, Stimulus 
8, which was 42.2 square inches. These 
stimuli were presented on a 20 X 30-inch 
tray of heavy cardboard. A 16 x 20.5- 
inch cardboard screen was used to hide 
the stimuli between trials. In his first set 
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of experiments, Zeiler reports two sorts 
of data: (a) those in which the training 
and test stimuli overlap at two points 
(e.g. 1-2-3:2-3-4), and (b) those in 
which the stimuli overlap at one point 
(1-2-3: 3-4-5) or not at all (1-2-3: 4-5-6). 
In the former experiments Ss transpose 
when tested, while in the latter they 
choose the stimulus which is closest in 
value to the originally correct. For these 
experiments Zeiler is able to predict the 
test results by using the same weights 
that he had previously used to account 
for Stevenson and Bitterman's (1955) 
data. Though such an achievement 
seems impressive, Zeiler's experiments 
are sufficiently similar to Stevenson and 
Bitterman's original ones that Stevenson 
and Bitterman's interpretation in terms of 
relative and absolute aspects of the stimu- 
lus situation remains equally tenable. 

In his next set of 17 experiments Zeiler 
increases the size ratio, but the general 
nature of the problems remains the same. 
Only 7 of these experiments with large 
ratios between stimuli (e.g., 1-3-5: 3-5-7) 
involve a prediction based on AL theory 
which is pitted against the prediction 
that S will choose the stimulus which was 
correct during training. In 6 of the 7, 
the AL predictions—based on the pre- 
vious weighting—fail, and only 1 is con- 
firmed. In all 6 failures Ss select the 
originally correct stimulus. 

Zeiler suggests that in determining AL 
the training stimuli should perhaps be 
weighted more heavily when they are 
very discriminable from each other, but 
he gives no rationale. Furthermore, ir 
some of these cases the discriminability 
of the fest stimuli is also much greater 
than in his early experiments. In any 
event, the simplest explanation of these 
data is that the response is under the con- 
trol of the positive training stimulus and 
not of a shifting AL. 

Zeiler (1963a) reports another sort of 
evidence which he interprets in AL terms. 
Ss were trained with an asymmetrical 
array of stimuli and tested with an over- 
lapping symmetrical array (e.g., 1-2-8: 1- 
2-3). The training stimuli were alwavs 

3 A wide range of weights will do equally 
well. 
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-2-8, and the test stimuli for different 
€ were 1-2-6, 1-2-5, 1-2-4, and 
1-2-3, Ss tested with 1-2-6 overwhelm- 
ingly chose the originally correct Stimu- 
lus 2, but as the largest test stimulus 
decreased in size there was an increasing 
tendency to select Stimulus 1 Although 
this last result is in accord with the AL 
hypothesis, the results for all four con- 
ditions considered together are not. 
Zeiler suggests that perhaps asymmetrical 
conditions require different weights, but 
again the rationale is not clear. 
Although the shift to Stimulus 1 is not 
predictable from Stevenson and Bitter- 
man’s (1955) hypothesis, an ex-post- 
facto interpretation is obvious. The pres- 
entation of a test set, 1-2-3, after training 
on 1-2-8 leaves one of the relationships 
undisturbed (i.e., 1-2), while another one 
(2-3 as opposed to 2-8) is radically 
changed. If during training S has 
learned, among other things, to choose a 
stimulus which is a particular amount 
smaller than the largest, then the substi- 
tution of 3 for 8 should force S’s choice 
toward 1. In short, these new phe- 
nomena are of real interest, but there is 
nothing in them that contradicts the as- 
sumption that S responds to both the 
absolute and the relational properties of 
the stimuli with the relative importance 
of these two factors determined by a 
variety of conditions during training and 
testing. But the fact remains that the 


TABLE 1 


NuMBER OF SUBJECTS CHOOSING SMALL (1), 
INTERMEDIATE (2), AND LARGE (3) 
Test STIMULUS AFTER TRAINING 
WITH STIMULI 1, 2, AND 8 


Test choice 


Condition —— pue 
N S I n 
Zeiler's experiment 30| 16 | 10| 4 
Replication of Zeiler's 
experiment 28| 0 | 17 | 11 
Instructions to attend 
to all 3 stimuli 13 0 9 4 
Nonrandom training 
arrays 11 2 6 3 
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shift from the intermediate stimulus to 
the smallest as the value of the largest 
stimulus declines is a dramatic effect pre- 
dicted by the AL position and not merely 
rationalized after the fact. 


Some New EMPIRICAL DATA 


In view of the importance of this 
switch in preference the authors decided 
to repeat the most extreme condition of 
Zeiler’s experiment—that is, the group 
trained with 1-2-8 and tested with 1-2-3. 
It was intended to repeat Zeiler’s basic 
phenomenon and then to ask whether 
variations in training would affect the 
response in a way consistent with Steven- 
son and Bitterman’s position. Our repe- 
tition of Zeiler’s 1-2-8:1-2-3 condition 
with 28 4- and 5-year-old Ss from day- 
care centers in Oakland, California, pro- 
duced results shown in Row 2 of Table 
1. Zeiler's results are shown in Row 1 
for comparison. Zeiler’s specifications 
for random spatial arrangements, place- 
ment of stimuli on the tray, correction 
procedures, and criterion (5 consecutive 
correct) were followed exactly. Despite 
the fact that Zeiler’s Procedure was dupli- 
cated as it is described and similar learn- 
ing performance was observed, the au- 
thors were unable to obtain Zeiler’s 
results in the test, and indeed the trend 
was in the opposite direction. 

i Zeiler (1963b) observes that the shift 
is most likely to occur if S attends to all 
three stimuli. Results in Row 3 are for 
13 Ss who were carefully instructed to 
look at all three stimuli before choosing. 
No appreciable change in behavior oc- 
curred. In a further effort to force S to 
attend to the difference between 2 and 8 
in training, the sequence of stimulus ar- 
rangements was changed from one in 
which the various spatial arrangements 
occurred in random order to one in which 
2 and 8 were always contiguous, Row ' 
shows a possible slight effect from this 
change, but nothing resembling Zeilers 
results. The authors have no satisfactory 
explanation of the inability to repe? 
Zeiler's results, zl 

While the failure to repeat this critic? 

result is unquestionably embarrassing t° 


"M 
| m 
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the AL interpretation, the failure to find 
an appreciable effect from attempts to 
ensure S’s learning of the 2-8 difference 
is also embarrassing for the position that 
transfer behavior is determined by the 
degree to which the training situation em- 
phasizes absolute or relational cues. In 
an effort to show that what is learned dur- 
ing training determines the transfer be- 
havior in this situation, the authors re- 
turned to a conventional, symmetrical 
middle-size problem in which training 
was on 1-2-3 and testing was on 2-3-4. 

P Gilinsky’s (1955) work suggests that 
instructions can be used to affect the 
probability of relative and absolute re- 
sponding, and Rock and Ebenholtz’ (1959) 
study suggests that the relation between 
the background and the training stimuli 
is also a potent determiner. Consequently 
these two variables were selected for 
study. 

After some trial and error, the authors 
realized that successful following of abso- 
lute instructions required that the back- 
ground be particularly salient. To achieve 
this the background board on which the 
training and testing stimuli were dis- 
played was modified by attaching three 
strips of white 4-inch Masonite 6 inches 
apart. These strips, which ran from the 
bottom of the board (near S) to the top, 
were 2.37 inches wide—the same width 
as Stimulus 2. 

On each training trial and on the test 
trial the three stimuli were placed on the 
3 strips (random-placement orders). 
Stimulus 1 was smaller than the strip, 
Stimulus 3 was wider, but two sides of 
Stimulus 2 were always flush with the 
edges of the strip. During training, 
Stimulus 2 was reinforced. All Ss were 
instructed to: “Find the right one—the 
one with the star under it.” After the 
common initial instructions, Ss were as- 
signed to one of three different groups 
which received additional instructions. 
Group I (Relative) received additional 
instructions to: “Find the middle-sized 
one’; Group II (Absolute,) was told: 
“See this one? The star will always be 
hidden under this one”; Group TII(Ab- 
solute,) was told: “Find the one that fits 
the board." These special instructions 
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TABLE 2 


NUMBER or SUBJECTS CHOOSING SMALL, 
INTERMEDIATE, AND LARGE TEST 
STIMULUS AFTER TRAINING WITH 

STIMULI 1, 2, AND 3 


Test choice 


Instructional group 
N S I 


I (Relative) 18 5 11 
II (Absolute;) 18 13 5 
III (Absolute;) | 18 17 1 


oom|tc 


Note.—Test stimuli are 2, 3, and 4. 


were not repeated on every trial during 
learning, and they were never repeated 
on the test trials. 

The results for these three group 
shown in Table 2 demonstrate very 
clearly that Ss' behavior in a transfer 
test after training on a middle-size prob- 
lem can be controlled by selection of the 
proper combination of background and 
instructions (x? — 18.23, df —2, p «01; 
in computing x? Columns 2 and 3 in Table 
2 were combined). Relative instructions 
produce relative responding. Absolute in- 
structions produce absolute responding. 
And absolute instructions that provide an 
explicit means of executing them (Group 
III) are almost completely effective. 


SUMMARY AND CONCLUSIONS 


Zeiler’s AL account of earlier transfer 
results following intermediate size train- 
ing is plausible, but because of its ad-hoc 
character it is no more conclusive than 
Stevenson and Bitterman’s earlier ex- 
planation in terms of the relative impor- 
tance of absolute and relational cues. 
Zeiler’s own new evidence is at least as 
inconclusive: In six of seven transfer 
tests which permitted Ss a choice between 
the stimulus predicted by AL theory and 
the one that was correct during training, 
Ss chose the latter. In addition, the au- 
thors have been unable to obtain the 
change in test-stimulus preference fol- 
lowing asymmetrical training that is pre- 
dicted by AL theory. Our own data show 
that manipulation of both instructions and 
background have pronounced effects upon 
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transfer behavior following intermediate 
size training. These effects, which can 
be attributed to the degree to which the 
training and testing situations emphasize 
the absolute or relational aspects of the 
stimuli, cannot be handled by current AL 
theories. 
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THE STIMULUS IN THE INTERMEDIATE SIZE PROBLEM 
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Riley, Sherman, and McKee maintained that a 2-process theory based 
on the learning of the absolute and middle size aspects of the stimulus 
was a better explanation of the intermediate size problem than the ratio 
theory. Various aspects of this contention were examined. The ratio 
theory appeared to be a more precise formulation than the 2-process 
alternative. In addition, no experiments with Ss below the level of 
adult humans that have used one training set have revealed that the 
middle size relationship can control behavior. Although most of the 
data were readily deduced by the ratio theory, there were results that 
required the assumption of precise absolute perception rather than the 
learning of ratios. Some alternative interpretations of the ratio theory 
were discussed, and a hypothesis based on 2 types of absolute learning 


was presented. 


As an alternative to the ratio theory 
(Zeiler, 1963b), Riley, Sherman, and 
McKee (1966) offered a two-process hy- 
pothesis based on the learning of absolute 
and relational stimulus properties to ex- 
plain the intermediate size problem 
(ISP). Coordinated with their explana- 
tion was the assertion that situational 
variables are important determinants of 
what is learned. The ratio theory, the 
two-process hypothesis, and the role of 
situational variables will each be dis- 
cussed. 


" 
Tur RATIO THEORY 


The ratio theory states that what is 
learned in the ISP is the perceived size 
rather than the relational aspect (middle 
sizedness) of the positive stimulus. Ac- 
cording to this theory, the perceived size 
of a stimulus is not defined solely by its 
absolute dimensions, but rather is de- 
scribed in terms of Helson's (1964) 
adaptation-level (AL) theory. This defi- 
nition of the appearance of a stimulus as 
due to its relationship to the prevailing 
frame of reference or AL reveals the am- 
biguity in the way that the term "rela- 
tional" has been used. Although the ratio 
theory asserts that relational principles 


eful to Jerome Wodinsky for 


1T am grat M f 
4 ding and discussion of earlier 


his critical rea 
drafts of this paper. 


are important because the perception of 
the size of a single stimulus depends on 
the other elements with which it appears, 
there is no reference to the relational 
concept of middle size. These two rela- 
tional definitions of the stimulus have dis- 
tinctly different meanings ; the first deals 
with the specific stimulus as a percept, 
while the second pertains to abstract prop- 
erties (“larger than,” “smaller than,” 
“middle sized”) that transcend particular 
stimuli. 

While Riley et al. (1966) asserted that 
the middle size relationship was learned 
in the ISP, they did not deny the im- 
portance of either the perceived size of 
the stimulus independent of its middle 
size quality or the role of frames of refer- 
ence in size perception. What they did 
question was the concept used in the ratio 
theory that the frame of reference is 
changed by both the training and the test 
stimuli; they believe that such adaptation 
processes would occur only under very 
special circumstances. The idea of shift- 
ing levels is an integral part of AL theory 
that has been used by Helson (1964) to 
describe behavior in a wide variety of 
situations and was not a principle in- 
vented to explain the ISP. Whether or 
not such processes are the ordinary or 
the special case in the ISP or in any 
other area of psychology is a question 
to be settled by an experimental analysis 
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rather than by argument. Pending such 
an analysis and in the absence of specific 
reasons for the rejection of this principle, 
it is difficult to understand why Riley et 
al. objected to this aspect of the ratio 
theory. E 

At any rate, the conception of percep- 
tion as due to a shifting frame of refer- 
ence does not exhaust the possibilities of 
what the ratio theory means. For ex- 
ample, suppose the subject (S) learns 
to choose a stimulus of a certain size in 
training and will continue to choose that 
same stimulus in the test. When trans- 
position occurs, it is because S perceives 
no differences between the middle-sized 
stimuli of the two sets and is, therefore, 
responding to the "same" stimulus. The 
ratio theory shows how large a change is 
required in order that the middle size 
stimuli be just noticeably different from 
each other. That is, when the middle- 
Sized member of the test set has the ratio 
most like that of the positive training 
stimulus, the two stimuli are perceived 
as identical; when another test-set mem- 
ber has the ratio closest to the positive 
training ratio, that element is perceived 
as being the same size as the positive 
training stimulus, and the middle-sized 
stimuli are not equivalent. The ratio 
theory, then, is primarily a way of talking 
about difference thresholds in the ISP 
rather than a statement about shifting 
perceptions. 

Still another alternative is that the 
formulas used in the theory provide noth- 
ing more than a convenient and simple 
model to explain or to describe choice. 
The psychological meaning of this model 
is a separate issue. 

In addition to accounting for the data 
of the ISP and predicting some new phe- 
nomena, the ratio theory has been speci- 
fic enough so that its predictions could 
be tested and found wrong. So far, when 
it has been incorrect, explanations have 
been used that were compatible with the 
basic assumptions. In contrast, in the 
absence of statements that predict when 
either absolute or middle size learning 
will occur and will be demonstrated in 
the test, the two-process theory is ad hoc 
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from the outset and has the dubious virtue 
of being impossible to prove wrong. 


RELATIONAL AND ABSOLUTE LEARNING 


Riley et al. asserted that the phenome- 
non reported by Zeiler (1963a) of the 
switch from a previously positive (Stim- 
ulus 2) to a previously negative (Stimu- 
lus 1) stimulus when the old positive 
stimulus was present in its original rela- 
tionship (Train 1-2-8, Test 1-2-3) could 
be explained by the two-process hypothe- 
sis. They stated that S learned the nature 
and the magnitude of both the 1-2 and the 
2-8 subrelationships in the training set 
and chose Stimulus 1 in the test because 
the 1-2 difference was held constant while 
the other pair was drastically altered (2-8 
to 2-3). But why should S not choose 
either Stimulus 2 or 3 rather than Stimu- 
lus 1 if the 1-2 relationship played an im- 
portant role in the learning of the training 
phase? Riley et al. maintained that such 
a subrelationship was learned in the train- 
ing discrimination, but discounted its ef- 
fect when discussing the choice in the 
test; in these terms, a switch to Stimulus 
1 means that what was learned about the 
1-2 relationship was either ignored or 
reversed on the test trial. Further re- 
search on the role of stimulus differences 
and subrelationships within a three- 
stimulus asymmetrical set is required be- 
fore this explanation of the data can be 
entertained as an alternative to the ratio 
theory. 

Setting aside considerations at the level 
of the relative explanatory and predic- 
tive merits of the two-process and the ra- 
tio theories, there is the question of a 
firm empirical basis for the two-process 
hypothesis. For the theory of Riley et al. 
to be plausible, there should be evidence 
that both relational and absolute learn- 
ing occur in the ISP. The main problem 
in this respect involves the existence of 
relational learning in the sense that it is 
used in the two-process theory; that is, 
the idea that the abstract property of mid- 
dle sizedness is learned, 

Since transposition is deduced with- 
out reference to the learning of the mid- 
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dle size concept with the ratio theory, the 
mere occurrence of transposition does 
not prove that this relational principle 
is involved in the solution of the prob- 
lem. What, then, is evidence that the 
middle size concept is important? A con- 
vincing demonstration would be a uni- 
form gradient of perfect transposition 
with test sets of various degrees of dif- 
ference from the training set. Such be- 
havior, which would be best explained by 
reference to the middle size relationship, 
has never been obtained in the ISP. 
With the notable exception discussed be- 
low, transposition occurred only when the 
training and the test sets were very 
similar to each other. This is precisely 
what the ratio theory predicts. 

Evidence for the learning of both the 
absolute and the middle size aspects of 
the positive training stimulus was pro- 
vided by a study with adult humans 
(Zeiler, 1964) in which test choices were 
on either basis with any test set. On the 
first test trial, however, adults transposed 
when the training and the test set were 
very different from each other and re- 
sponded to the previously reinforced ab- 
solute component when the two sets were 
similar. Here, in a study that revealed 
the simultaneous existence of both types 
of learning, the relational cue was used 
only when there was a large change be- 
tween the sets. Since the adult behavior 
was opposite to that of children and in- 


frahuman species who transposed with 
similar sets and chose the stimulus most 
aining 


similar in size to the positive tr 
stimulus with more different combina- 
tions (see Zeiler, 1963b, for a review of 
the literature), there must be some dif- 
ferences in the underlying processes. Al- 
though the two-process hypothesis is not 
a compelling general explanation of the 
ISP since two processes are not needed to 
explain the test choices of children and 
infrahuman species, it has value when re- 
served to describe the behavior of adults. 


SITUATIONAL VARIABLES 


The author is in complete agreement 
with an emphasis on situational factors 


as determinants of what is learned in 
the ISP. But the following brief review 
shows that the effect of every situational 
variable that has been evaluated with a 
range of test sets was to change the 
amount of absolute learning that occur- 
red. There is no evidence that any 
variable, with the possible exception of 
multiple training sets, displayed learning 
of the middle size relationship. 

Recent findings revealed that behavior 
can be controlled completely by the abso- 
lute physical size of the positive training 
stimulus. Human adults given a suc- 
cessive discrimination (Zeiler, 1964) and 
pigeons given a simultaneous discrimina- 
tion (Zeiler, 1965) always chose the test 
stimulus most similar in size to the posi- 
tive training stimulus. Asymmetrical sets 
and sets composed of stimuli that were 
quite different from each other produced 
more absolute transfer than did those 
that were either more symmetrical or 
had smaller intraset differences (Zeiler, 
1963b; Zeiler & Gardner, 1966). 

While Riley et al. showed that instruc- 
tions increased absolute learning (Group 
II), the choice behavior of the “relational 
instructions” group (Group I) was like 
that reported by Zeiler (1963b). Since 
in my experiment the same combination 
of sets was used but no special instruc- 
s were given, the relational instruc- 
m to alter the way in 
learned without 


tion: 
tions did not see 
which the problem was 
such directions. 

Rudel (1957) found that a delay be- 
tween the training and the testing phases 
n transposition with a test set 
te choice occurred in an 
The author has initiated 
research to determine whether the effects 
after a delay are due to middle size learn- 
ing or occur for some other reason. Tf 
the middle size cue controls behavior sev- 
eral hours after learning, transposition 
should occur with test sets of various de- 
grees of similarity to the training set. Tf, 
however, Rudel's (1957) data were due to 
the greater difficulty of differentiating the 
test from the training set after a delay, 
the specific test set with which response 
to the large or small stimulus occurs 


resulted i 
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immediate test. 


260 


might be further removed from the train- 
ing set than when testing is immediate. 
In the absence of such information, no 
conclusions about the role of delay are 
ible. 
uo et al. showed that a relevant back- 
ground stimulus together with suitable 
instructions was another variable that 
produced almost perfect absolute learning. 
But the meaning of these results is am- 
biguous, as having Ss match the positive 
stimulus to the background may have 
created a matching-to-sample situation 
which tapped different processes than 
does the ISP. 

Certain combinations of two training 
sets did produce uniform transposition 
(Gonzalez & Ross, 1958; Zeiler & Paal- 
berg, 1964). Since the use of two train- 
ing sets in which the only common rein- 
forced element is a certain relationship 
may extinguish the use of other cues, 
these experiments may have involved 
processes that do not play a part in the 
ordinary ISP. That learning of the mid- 
dle size relationship can be demonstrated 
at all indicates that the potential for such 
learning exists, but it does not show that 
this potentiality is used in the discrimina- 
tion of a single set. 


Wnar ts LEARNED? 


The absence of evidence for learning 
of the middle size relationship in any but 
the study with adults and the multiple 
training set experiments prevents the two- 
process theory from being a convincing 
explanation of the ISP. But, because of 
the growing catalog of situational vari- 
ables that cause absolute learning, the 
ratio theory is also unconvincing, 

The postulation of different kinds of ab- 
solute learning can account for all of the 
data. To the extent that contextual fac- 
tors alter the perceived stimulus size, per- 
ception is not of the precise qualities of 
the physical stimulus. Such an impre- 
cise perception of the stimulus size fits 

the ratio-theory model and explains trans- 
position with similar training and test 
sets and absolute or random choice with 
less similar sets. Another type of abso- 
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lute learning, resulting in absolute choices 
with all of the test sets, involves the exact 
perception of the individual stimulus size 
independent of and unaffected by changes 
in the context. These two types of per- 
ception, rather than being entirely dif- 
ferent from each other, represent degrees 
of absolute perception; phylogenetic level 
and situational factors determine the point 
on the continuum of the precision of ab- 
solute learning that controls behavior in 
a given case. 

The failure of Riley et al. to replicate 
Zeiler's (1963a) finding can be under- 
stood as a function of differences in ab- 
solute learning. My experiment indi- 
cated that the learning fit the ratio theory, 
while Riley et al. reported behavior that 
suggested a more precise perception of 
the size of the positive training stimulus. 
A few years ago, I reported difficulties in 
replicating this experiment and sug- 
gested that attentional factors might ex- 
plain the discrepancies (Zeiler, 1963b). 
The isolation of the variables that oper- 
ated in each of these studies to determine 
the type of learning that occurred would 
represent a substantial achievement. 
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In a recent theoretical note, Sines (1964) has characterized old- 
strategy test validation as of the form: “If B (a particular type or 
class of behavior), then P (a particular test response or pattern).” 
He offers a new strategy of the form: “If P, then B.” The present 
note shows that the old strategy (by using a control group) is really 
of the form: "P if, and only if, B” The new strategy is really of 
the form: “B if, and only if, P" and hence logically equivalent. The 
choice of strategy must be dictated by E’s goals and hypotheses. 


Sines (1964) has suggested a new 
“strategy” for the validation of psycho- 
logical tests. He offers this new strategy 
as a substitute for and an improvement on 
the earlier methods. The old strategy, 
according to Sines, begins with some be- 
havior (B) and attempts to find that test 
pattern (P) which could be inferred from 
that behavior. Sines symbolizes the logic 
of this approach as follows: “If B, then 
P.” In the new strategy, we begin with 
P and attempt to find that B which can 
be inferred from it (in symbols: “If B; 
then B"). Sines is, of course, correct 
when he asserts that these two symbolic 
Statements are not logically equivalent. 

In favor of his new strategy, Sines 
makes two points. The first is that the 
new strategy is more like what clinicians 
really do (this accounts, he says, for why 
they are successful in real life, but not in 
validation studies). The second is that 
in fact a given B may manifest itself in 
several different Ps (the B to P mapping 
may be one to many), and this would 
make the search for the unique P futile. 

The purpose of the present note is to 
set the analysis offered by Sines into a 
slightly larger context. The point to be 
made is that the new strategy is in fact 
logically equivalent to the old and that 
the choice of strategy should depend upon 
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the purposes of the experimenter or upon 
his theoretical expectations, 


LOGICAL CONSIDERATIONS 


Let us begin with a closer analysis of 
the old strategy. The original derivation 
of the MMPI scales (e.g., Hathaway & 
McKinley, 1940) should serve as a good 
case study. The assumption behind this 
research was that there is a unique test 
pattern associated with a Particular diag- 
nostic category. This is clearly the old 
Strategy (“If B, the behavior of the given 
diagnostic category, then P, the test pat- 
tern unique to the category”). Sines’ fear 
that there may be several Ps for the cate- 
gory is not a problem here because if on 
a given statement some Hysterics, say, 
respond “true,” and some respond “false,” 
this item will not appear unique to B, 
The real worry here is that someone who 
is not Hysteric will turn up with a high 
Hy score. 

The old strategists were aware of this 
danger, and to avoid it they used a con- 
trol group. They were, after all, in 
search of items unique to the diagnostic 
category, and so they chose as items for 
the Hy scale those which significantly 
Separated the Hysterics from their control 
subjects. In symbols, they tried to show: 
(a) “If B, then P"; and (b) "If non-B, 
then non-P." Given (a) and (5), then it 
is true that: “If P, then B" the form of 
Sines’ “new” strategy. 

If a criticism can be leveled against the 
logic of these experiments, it is in the fail- 
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ure to have composed the control group of 
all possible non-B. If this had been done, 
then no item could have appeared on more 
than one scale. It is plain, however, that 
the answer to this criticism is not the 
strategy suggested by Sines, namely, 
checking to see whether Hysterics also 
show other patterns. 

Let us now examine closely the new 
strategy as exemplified by the model study 
reported by Sines. In this study, the 
records of patients exhibiting a certain 
MMPI profile (actually having profiles 
within an arbitrary distance of the 4'3 
pattern) were checked for the behaviors 
reported in their case histories. If Sines 
were interested only in making an as- 
sertion of the form: “If P, then B," it 
would be sufficient merely to find a single 
behavior reported in all of the 4'3 pa- 
tients' case folders. In actual practice, 
however, Sines employed a control group, 
and he looked for that behavioral item 
Which those patients with a 4'3 pattern 
had more than those in the control group. 

In symbols, Sines tries to show: (a) 
"If P, then B"; and (b) "If non-P, then 
non-B.” Given a and b, then it follows 
that: “If B, then P”—the form Sines 
regards as old strategy. 

The logical equivaler 
Strategies may also be A 
noting that in Sines’ study, the assign- 
ment of the factors P and B is completely 
arbitrary, The MMPI “test pattern” is 
as much behavior as is any other response 
or set of responses. Similarly, it does not 
strain the concept of test to regard case 
histories as tests. If we make this reas- 
signment of letters, it is clear that this 
Study is identical in form to Hathaway 
and McKinley’s (1940) study- 

The above analyses show that the old 
strategy by the use of a control group 
actually sought to show «P if, and only if, 
B” Similarly, through the use of a con- 
trol group, the new strategy 1S of the 
form “B if, and only i5 mm Logically, 
these two assertions are equivalent. The 
strategies differ only in where they begin: 
the old strategy beginning with some be- 
havior for which a unique test pattern 1s 
sought, the new strategy beginning with 


nce of the two 
demonstrated by 
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a test pattern for which a unique be- 
havior is sought. 


EMPIRICAL CONSIDERATIONS 


To illustrate the superiority of the new 
strategy, Sines conducted a second study 
which presumably shows how an old 
strategist would have conducted his first 
study. The results were negative: Only 
1 person of 31 who were judged assaultive 
did have the 43 MMPI pattern, This 
finding is not proof that the two strategies 
yield different results. Since we have 
shown that they are logically equivalent, 
this second study is simply a contra- 
dictory finding. 

There are several aspects of Sines' pro- 
cedures which might account for contra- 
dictory results. First, Study I was never 
cross-validated. Second, it should be 
noted that Study I did not show "If non- 
P, then non-B" exactly (since 40% of the 
non-P showed B); therefore, the first or 
the second study may be a chance finding. 
Third, there were several other Bs (see 
Sines, 1964, p. 521, Table 1) which should 
have been investigated also. Fourth, 
there is some question concerning the test 
for B in the two studies. In Study II, 
ns were selected who had either 
“threatened to kill or had seriously as- 
saulted someone." In Study I, however, 
the criterion was "history of hostile or 
aggressive behavior." It should be noted 
that none of the 4/3 patients who met 
that criterion was “felt to be dangerous" 
or “had a violent temper"; whereas six 
of the non-4’3 patients (control group) 
were felt to be dangerous, and three were 
judged to have a violent temper. This 
latter point raises the general question of 
the adequacy of a social history for this 
kind of research. 

In further support of the superiority of 
the new over the old strategy, Sines 
presents a hypothetical example. In this 
example a particular B, assaultive be- 
havior, manifests itself in several differ- 
ent test patterns. We are given the 
following invariances: Of 100 persons 
showing B, 40 have test pattern Py, and 
15 each have test patterns Ps, Ps, Pa, and 
P, We have somehow isolated only test 
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ern P, and are interested in finding 
e relation of this pattern to B. Sines 
contends that if such a relation existed, 
the old strategy would fail to reveal it. 
This is not correct, however. 

Using the old technique, we would col- 
lect cases of B and examine the tests of 
these individuals for the presence of Po. 
We would also randomly select a control 
group of non-B cases from the same 
population. If we collected 100 cases of 
each, they would fall into four discrete 
groups: B, Ps; B, non-Ps; non-B, P»; 
non-B, non-Ps. According to the condi- 
tions of the problem, the numbers in each 
of these classifications would be, respec- 
tively, 15, 85, 0, 100. A chi-square on 
these data woud be equal to 16.2, sig- 
nificant beyond the .001 level, revealing 
a definite relation between B and Ps. 

It must be admitted that the new 
strategy which would have had us collect 
cases of Py and non-P and examine these 
for B would have yielded a more efficient 
result since the number of cases required 
would have been smaller. However, 
rarely does one wish merely to relate one 
test pattern to one behavior. More fre- 
quently the attempt is made to relate one 
(or several) behavior(s) to several test 
patterns. In the model study reported by 
Sines, several behaviors were to be re- 
lated to the one test pattern. 


CHOICE or STRATEGY 


Since the two strategies are logically 
equivalent and should produce the same 
empirical results, the choice of strategy 
must depend on the needs of the experi- 
menter. The test maker in his thinking 
must begin with B. He wants by defini- 
tion to create a situation (the test) in 
which B will manifest itself either di- 
rectly or indirectly. P is the indicator, 
thesign of B. His rationale explains why 
this relation exists. A "P, B" relation 
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which just exists leaves a theoretical gap 
in his work. 

For all the examples given by Sines, 
the MMPI, Rorschach, Thematic Ap- 
perception Test, and Draw-A-Person 
Test, rationales exist. Sines’ own MMPI 
study begins with a pattern which was 
evolved from old-strategy thinking. Given 
just the responses to the 566-item MMPI, 
it is difficult to imagine how one would 
begin to isolate patterns which would be 
relevant to socially significant behavior. 
Were this even possible, the question of 
“why” would nag and demand attention. 

Once a test has been devised (and per- 
haps only then), it may be possible for 
a user to develop a hypothesis for which 
the new strategy would be appropriate. 

Furthermore, it is unlikely that the test 
users, the experts mentioned by Sines who 
can infer a personality structure from a 
Rorschach or MMPI profile, do so by 
sorting mentally through all the profiles 
they have seen. More likely, their think- 
ing represents a combining of the various 
behavior tendencies which they see in- 
dicated by the elements of the profile. 

Sines’ analysis should serve to alert 
test validators to the possibilities of a 
given behavior pattern manifesting itself 
in several test patterns. The other side 
of the coin should not be overlooked, to 
wit, that a single test pattern may indi- 
cate several kinds of behaviors. The 
choice of strategy must be dictated by the 
experimenter’s goals and hypotheses. 
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Schizophrenics tend to exhibit d 


ferent areas. This paper attempts to a 
ation and conceptualization disturbances. 
and regression in terms of a common proc 


ization of response hierarchies. 
process, these different areas 
have similarities in terms O 
and conditions under which 
formal account which involves 
arousal on competing responses, 
response-strength ceiling, is used 
pected across content areas. The 
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different kinds of schizophrenic ab 


This paper is concerned with the 
question, “Why do different behaviors 
Which are thought of as ‘schizophrenic’ 
go together?" Our purpose is to pre- 
Sent a possible explanation for the clus- 
tering of some schizophrenic behaviors, 
basing the explanation on a process 
which is seen as underlying different 
areas of schizophrenic behavior. The 
discussion is not intended as a defini- 
tive account of all behaviors which oc- 
Cur in persons diagnosed as schizo- 
phrenic, partly because much of what 
will be said requires additional support, 
and partly because our goal is to em- 
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eviant behavior in a number of dif- 
ccount for schizophrenic associ- 
, delusions, hallucinations, 
ess involving partial random- 
Because of a common underlying 
hizophrenic behavior are expected to 
er from normal behavior, 
ll be observed. A 
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a direct result of increased arousal 
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tendencies evoked in a situation. 
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conceptualization which have been con- 
sidered basic or of major importance 
in schizophrenia (Bleuler, 1950; Cam- 
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phasize defensive behaviors which serve 
an anxiety-reducing function. There 
is little doubt that defensive behaviors 
are used by schizophrenics. However, 
defensive behaviors will be seen as 
probable, but not necessary, correlates 
of the process which is the focus of 
this paper. This focus follows two 
complementary lines of thought: (a) 
that behavioral deficit in schizophrenia 
is the result of arousal or anxiety (see 
literature reviewed in Distler, May, & 
Tuma, 1964; Lynn, 1963; Mednick, 
1958), and (b) that behavioral deficit 
may be related to arousal apart from 
possible defensive or avoidance func- 
tions of the behavior (Fish, 1961; 
Freeman, 1960; Malmo, 1959). 

The basic theoretical account of the 
arousal-related process to be used here 
has been presented previously (Broen 
& Storms, 1961). In this account, a 
stimulus situation evokes habitual 
tendencies to respond which vary in 
strength depending on amount of pre- 
vious training. Habit strength inter- 
acts multiplicatively with arousal 
(drive) level to yield response strength. 
When more than one tendency to re- 
spond is evoked in a situation, the 
probability of the dominant response 
is an increasing function of the dif- 
ferences in response strengths between 
dominant and competing responses. A 
response-strength ceiling which is lower 
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than the product of maximum arousal 
level and maximum habit strength is 
also postulated. In this account, if 
arousal level and habit strength of the 
dominant response are sufficient to 
raise the strength of the dominant re- 
sponse to its ceiling, further increases 
in arousal can only facilitate competing 
responses. The result is a decrease in 
the relative dominance of the dominant 
response, with increased probability of 
competing responses. 

The basic relationships in the ac- 
count are illustrated in Figure 1. Note 
the figure on the left first. This figure 
is intended to illustrate alternate re- 
sponses which are evoked in different 
strengths (e.g., the dominant re- 
sponse “light” and competing response 
“night” evoked by the stimulus word 
DARK in a word-association task). 
When arousal level is increased beyond 
the level where the strength of the 
dominant response reaches ceiling, the 
hierarchical order of dominant and 
competing responses tends to break 
down. With increasing arousal, the 
tendency is for alternate responses to 
become more equal in strength and 
probability of occurrence (Arousal 
Level 3 in Figure 1). 

Normal, appropriate behavior is 
characterized by dominance of appro- 
priate responses. Thus in an appropri- 
ately responding person, an increase in 
arousal which decreases the frequency 
of dominant responses usually reduces 
appropriateness of behavior. This de- 
creased appropriateness will be of a 
particular type. With decreased hier- 
archial dominance of appropriate over 
competing responses, behavior will be 
characterized by increased fluctuation 
between appropriate and competing re- 
sponses. This type of disorganization 
of normal response hierarchies seems 
to be common to each of the content 


areas of schizophrenic behavior we 
will discuss. 
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As is illustrated in Figure 1, note 
that the extent of hierarchical disorgan- 
ization is a function of three variables: 
(a) the strength of dominant and com- 
peting responses, (b) the level of 
arousal, and (c) the level of response- 
strength ceiling. If response strengths 
of dominant and competing responses 
are low, an increase in arousal may 
increase the dominance of the appro- 
priate response (as in the increase in 
arousal from 1 to 1.5 in the left figure). 
Also, if competing responses are not 
present, there is no decrement in the 
relative strength of the appropriate 
response regardless of arousal level. 
The effect of individual differences in 
ceiling level can be seen by comparing 
the left and right figures. A low ceil- 
ing leads to disorganization of a re- 
sponse hierarchy under lower levels of 
arousal. It is important to note that 
because of differences in level of ceiling 
we expect the same type of disorganize 
behavior to occur at different arousal 
levels in different persons. While we 
expect similar changes in behavior to 
occur across individuals as a function 
of intraindividual variations in arousal, 
a specific level of arousal will not be 
assumed to be related to a specific 
amount of disorganization of response 
hierarchies. Not all schizophrenics ex- 
hibiting the type of behavior to be dis- 
Cussed here will be expected to be 
characterized by high levels of arousal. 
. Some additional comments will aid 
in translating the account into expert 
mental predictions. The term arousal 
1s used to indicate a dimension of rela- 
tively diffuse cortical and physiological 
activation which may be measured by 
Changes in peripheral physiological in- 
dicants such as increases in heart rate, 
blood pressure, and respiration rate. 
Extended discussion concerning the 
Concept of arousal or activation as we 
use the terms can be found in Carlton 
(1963), Gellhorn (1964), Malmo 


(1959), Pinneo (1961), and Schnore 
(1959). The studies we will cite as 
having varied arousal experimentally 
use independent variables which are 
similar to variables which have been 
shown to lead to the general physio- 
logical changes indicative of arousal. 
A particularly good example of such 
a variable is induced muscular tension 
(Pinneo, 1961). Variation in anxiety 
or psychological stress is also seen as 
related to arousal variation (Schnore, 
1959; Spence, 1958b), although cau- 
tion is needed in the interpretation of 
studies using specific methods of vary- 
ing anxiety which have not been 
related to physiological indicants of 
arousal. 

We have used three measures of 
habit strength in making predictions 
for our own research and in interpret- 
ing other research. In experiments in 
which new S-R connections are 
trained, the habit strength of a response 
is assumed to be an increasing func- 
tion of the number of occurrences of 
that response during training. Habit 
strength is also assumed to vary in- 
versely with the amount a stimulus 
differs from a training stimulus when 
the subject is familiar with the stimulus 
alternatives in the experiment. How- 
ever, these definitions are of little use 
in many of the studies we will cite 
se these studies deal with res- 


becau ; 3 
sponses with unknown learning his- 
tories. In studies using stimuli and 


ch are commonly used, 
associations to familiar 
11 assume that the habit 
ominant response is 
at its maximum, and the average habit 
strength of a competing response for 
groups of schizophrenics 1s a function 
of the probability of that response as 
observed in normals. A 

Response-strength ceiling effects in 
experiments are predicted on the basis 
of two assumptions. First, schizo- 


responses whi 
for example, 
words, we wi 
strength of the d 
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phrenics are assumed to have lower 
average response-strength ceilings than 
nonschizophrenics. As a result, the 
response hierarchies of schizophrenics 
should become disorganized at lower 
levels of arousal. This assumption is 
consistent with data suggesting that 
normals may operate efficiently within 
a broader range of arousal than schizo- 
phrenics (Lynn, 1963; Venables, 
1960). Second, we will assume that, in 
a group of persons who exhibit schizo- 
phrenic behavior in hospital environ- 
ments, the stress of standard experi- 
mental situations is sufficient to lead to 
ceiling effects when familiar associa- 
tions are tested. With this level of 
arousal and lower ceilings, schizo- 
phrenics should exhibit more of the 
type of disorganization expected from 
response-strength ceiling effects than 
nonschizophrenics. 

The implications of the present ac- 
count overlap in some respects hy- 
potheses proposed by Chapman (1958), 
Payne (1961), Meehl (1962), espe- 
cially in his second neurological model, 
Weckowicz and Blewett (1959), and 
Stilson and Kopell (1964) in their 
neurophysiological thinking. The ac- 
count also has a degree of similarity 
to Russian thinking concerning a re- 
duction in internal inhibition in some 
schizophrenics which would selectively 
facilitate nonreinforced responses and 
responses to weak stimuli (Lynn, 
1963). There is also a great deal of 
similarity to the neurophysiological 
theory of schizophrenia proposed by 
Fish (1961). 

Because of common background in 
extensions or modifications of con- 
cepts from Hullian learning theory, 
this account is also similar in some 
respects to an account of inappropriate 
behavior in terms of competing re- 
sponses raised above threshold by in- 
creased drive (Spence, 1956, 19582; 

Taylor & Spence, 1952). This thresh- 
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old theory was used by Mednick 
(1958) in a very stimulating account 
of some aspects of schizophrenia. 
However, the accounts differ on the 
basic point of how increased arousal 
leads to inappropriate behavior and 
thus also differ in predictions regarding 
schizophrenic behavior (for discussions 
of the accounts see Broen & Storms, 
1961; Broen, Storms, & Schenck, 
1961; DeMille, 1959; Hill, 1957). 

It may be useful to note a basic 
difference between the ceiling account 
and the threshold account. When 
response-strength threshold factors, 
rather than ceiling effects, are seen as 
leading to decrement in a dominant 
response, increased arousal is expected 
to lead to decrement only when re- 
sponse strength is low. In the usual 
situation where a response which is 
appropriate has higher habit strength 
than a competing response, threshold 
effects would lead to the following : 


. at very low levels of drive and training 
a relatively weak-drive group would be 
expected to perform at a higher level than 
a somewhat stronger-drive group. On the 
other hand at higher levels of training and 
drive the reverse picture would hold; that 
is, the performance of the higher-drive group 
should be above that of the relatively weaker- 
drive group [Spence, 1958a, p. 92]. 


In contrast, response-strength ceiling 
effects would lead to dominant-response 
decrement primarily under higher 
levels of drive and higher levels of 
strength of dominant and competing 
responses. The following discussion 
of some aspects of schizophrenic be- 
havior is based on consideration of 
response-strength ceiling effects. 


ASSOCIATION DISTURBANCE 


Both clinical and research literature 
suggest that schizophrenics have differ- 
ent associations in their thought proc- 
esses or, on a more overt level, different 
verbal associations from normals or 
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other psychiatric patients (Deering, 
1963; Sommer, Dewar, & Osmond, 
1960; Storms & Broen, 1964). Look- 
ing at the particular nature of this 
difference, it seems that the schizo- 
phrenic's associations are not discon- 
nected, highly irrelevant thoughts. In- 
stead, schizophrenic associations are 
characterized as “unprecise approxima- 
tions” which “cannot be dismissed as 
either ‘irrelevant’ or ‘incoherent’ for 
they are certainly about the general 
subject and loosely related to one an- 
other [Cameron, 1944, pp. 53-54].” 
Bleuler (1950) described the deviant 
but related associations another way, 
characterizing ^ some schizophrenic 
thinking as “ideas of a certain category 

. . thrown into one pot, mixed, and 
subsequently picked out at random [p- 
16]." As a more specific example of 
Schizophrenic “looseness of associa- 
tion,” Arieti (1955, p. 4) considers 
the schizophrenic disorder to be implicit 
in the answer, “White House,” to the 
question, “Who was the first president 
of the United States?” _ Burstein’s 
(1961) study comparing schizophrenics 
and normals on selecting synonyms 
from a list of words also suggests that 
the primary verbal difference between 
schizophrenics and normals is the 
Schizophrenics’ overuse of associations 
which are inappropriate but related to 
the stimulus, such as antonyms an 
homonyms. x 

. This particular form of association 
disturbance seems to be consistent with 
the kind of behavior expected under 
high arousal within the account used 
in this paper. As has been noted, an 
increase in arousal beyond the level 
Necessary to raise a dominant response 
to maximum strength will tend tO 
equalize the response strengths of domi- 
nant and competing responses. Thus, 
the expected form of disturbed behavior 
Would be characterized by increase 
randomization among the responses in 
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response hierarchies (Bleuler's, 1950, 
“ideas in a certain category") with a 
consequent increase in the probability of 
those behaviors which are at relatively 
low strength in normal response hier- 
archies. The increase in within-hier- 
archy, associated-but-deviant responses 
seems much like the descriptions of the 
characteristics of associative disturb- 
ance in schizophrenia mentioned above. 
If schizophrenic associations can be 
rimarily understood as associations 
under high arousal, the formal account 
has other implications in addition to the 
expectation that deviant responses will 
be loosely relevant. The partial equali- 
zation of response strengths of alter- 
nate responses under increased arousal 
also implies an instability of associa- 
tions. In addition, in schizophrenics, 
instability of associations should in- 
crease with increased arousal level. 
Greater instability of associative re- 
sponses in schizophrenics than in nor- 
mals was found on repeated association 
tasks by Sommer et al. (1960) and 
Storms and Broen (1964). This asso- 
ciative instability in schizophrenics 
ease under stress con- 


does seem to incr s 
ditions, such as time stress and in re- 


sponse to words rated as anxiety pro- 
ducing, and the increase In instability 
under stress tends to be greater m 
schizophrenics than in neurotics and 
normals (Storms & Broen, 1964). 
The latter result was interpreted as 


due to schizophrenics having 


being Zop 
greater arousal reactivity to stress con- 
ditions, oF having lower response- 


and therefore being 
show the response- 
strength ceiling effects discussed above. 

As was noted previously, the account 
used here also implies that schizo- 
hrenic association disturbance should 
þe greatest on tasks where competing 
responses are present and have rela- 
tively high habit strength. This im- 
plication is consistent with data in- 


strength ceilings, 
more likely to 
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iv ormals, 
dicating that, cow ls a accent 
schizophrenics - a ‘he ett ci wonls 
o tasks Siu Fed than on tasks 
with mpe dom word (Faibish, 
mi m another relevant study, 
Blaufarb (1962) compared schizo- 
phrenics and normals on the pompe 
associated with proverbs, with the am- 
piguity of the proverbs varied by two 
methods of presentation (i.e., present- 
ing proverbs singly, and presenting 
three proverbs having the same mean- 
ing in a group, with the group method 
assumed to provide a more precise 
stimulus which would reduce the num- 
ber of inadequate associations). The 
performance of schizophrenics was sig- 
nificantly poorer than that of the nor- 
mals only under the more ambiguous 
method of presenting proverbs, again 
suggesting that schizophrenics’ associa- 
tions are most deviant on tasks where 
competing associations are present. 
Paired-associate learning tasks have 

also been used to investigate the hy- 
pothesis that schizophrenics are espe- 
cially deficient on tasks where compet- 
ing responses are present in strength, 
The results are inconsistent but ap- 
parently tend to support the hypothesis 
(Kausler, Lair, & Matsumoto, 1964: 
Spence & Lair, 1965, support the 
hypothesis, while Spence & Lair, 1964, 
report negative results). However, 
it is difficult to interpret the results 
of these studies in terms of the 
relative decrement in dominant re- 
sponse expected from the present ac. 
count. This is because during training 
the correct response is not dominant at 
first and then becomes dominant, but 
the data are overall measures of per- 
formance which do not take variation 
in dominance into account. 

In summary, there is some Support 
for the hypothesis that schizophrenic 
associations may be influenced by the 
type of disorganization of response 
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hierarchies expected from ceiling ef- 
fects: (a) Schizophrenic deviant asso- 
ciations tend to be loosely associated 
with the stimulus; (b) schizophrenic 
response hierarchies tend to be un- 
stable, with this instability increasing 
under stress; and (c) schizophrenic 
association deficits tend to occur pri- 


marily in situations which elicit multi- 
ple associations. 


Concert DISTURBANCE 


Certain changes in conceptual re- 
sponses are also expected as a result of 
increased arousal. By “conceptual 
responses” we mean responses which 
are appropriate only to specific classes 
of objects (e.g., using the label “birds” 
only to designate birds). The same 
task and arousal conditions which were 
relevant to the occurrence of “looseness 
of association” are reley: 


: ant to concept 
disturbance. 


When appropriate con- 
ceptual responses are dominant and 
competing resp 

high strength, incre 
increase the frequenc 
conceptual responses, As an example 
consider Aes Which has been used 
in research on concept disturb 
(Chapman, 1961), am is us 
cards on which the names of different 
objects are printed, and he is instructed 
to select the cards Which refer to 
birds, j cards designate 
flying insects, , When the subject looks 
the response which is e bye 


appropriate to 
will b 


Ppropriate 
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inclusive use of a conceptual response 
which may result from increased 
arousal. 

Increased arousal may also result in 
a too narrow or overexclusive use of a 
conceptual response. Some objects 
which should be included in a class of 
objects may be similar in some respects 
to objects which do not belong in a 
class. Such objects would evoke com- 
peting tendencies not to include them 
in their appropriate category. With 
sufficient increase in arousal, these 
competing tendencies toward inappro- 
priate exclusion would be exacer- 
bated. Note that overexclusive and 
overinclusive responding are not mu- 
tually incompatible. If competing 
response tendencies to include certain 
inappropriate objects in a class and to 
exclude certain appropriate objects are 
both present, increased arousal may 
increase both types of conceptual errors. 

Deviant breadth of concepts is a 
commonly noted aspect of schizo- 
phrenic behavior. Consistent with the 
above expectations, schizophrenic con- 
cept performance has been character- 
ized both as overexclusive (“concrete”) 
and overinclusive (see literature re- 
viewed by Chapman, 1961; Payne, 
1961). Usually studies have found 
schizophrenic concepts to be either 
overexclusive or overinclusive, with 
more recent studies emphasizing over- 
inclusion. A few studies have found 
schizophrenic conceptual performance 
to be both overexclusive and overinclu- 
sive (Chapman, 1961; Chapman & 
Taylor, 1957; Zaslow, 1950). In 
some contrast to the rationale of the 
present discussion it has been suggested 
that type of conceptual error in schizo- 
phrenics is a function of type of 
subjects, with organic pathology in 
“process” schizophrenics leading to 
overexclusive concepts (e.g., Tutko & 
Spence, 1962). Subject and task 
variables may both be important in 
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TABLE 1 


MEAN ERRORS ON CONCEPT-SORTING TASKS 
FOR SCHIZOPHRENICS AND NORMALS 


Overinclusion errors | Overexclusion errors 


Task 
Normal] Schizophrenic | Normal] Schizophrenic 


D] oam 1.61 1.52 2.61 
2 | 83 2.08 | 2.94 8.06 
3| 42 6.85 94 2.40 
4 | 2.12 10.57 61 1.39 


Note.—Adapted from Chapman (1961). 


determining type of conceptual disturb- 
ance. Cortical damage in some schizo- 
phrenics may lead to overexclusion, 
while an arousal process in others leads 
to overexclusion or overinclusion, de- 
pending on task conditions. 

Within the present account, increased 
arousal will increase a particular kind 
of error only in a task which evokes 
a tendency toward that error in some 
strength. To the extent that differ- 
ences between schizophrenics and nor- 
mals are due to the arousal process, 
this means that schizophrenics should 
make the same types of conceptual 
errors as normals, except that schizo- 
phrenics should make more of these 
errors. Also because exacerbation of 
errors under arousal depends in part 
on the strength of error tendency 
evoked by a task, the frequency of 
errors across tasks should tend to be 
correlated for schizophrenics and nor- 
mals. Evidence on the above points is 
provided in the study we have used to 
provide a research example (Chapman, 
1961). Chapman used four kinds of 
conceptual card-sorting tasks and re- 
corded both overinclusive and overex- 
clusive errors. Relevant data from 
Chapman's study are presented in 
Table 1. 

Note several things in Table 1. 
First, both overinclusion and over- 
exclusion errors are made by schizo- 
phrenics and normals, with schizophre- 
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nics making more of both types of 
errors. Also, in three of the four 
tasks the dominant error direction 
(overinclusion versus overexclusion) 
is the same for both schizophrenics and 
normals. In addition, note that for 
overinclusion and overexclusion errors 
the error frequencies for schizophrenics 
and normals have 1.0 rank-order cor- 
relation across tasks. It may also be 
of interest to note that in a different 
content area, syllogistic reasoning, 
Gottesman and Chapman (1960) found 
that hierarchies of reasoning errors 
within tasks were similar for schizo- 
phrenics and normals, the difference 
consisting of a partial randomization of 
response hierarchies in the schizophre- 
nics. As expected from the paradigm 
used here, the frequency of a particular 
type of conceptual error does seem to 
be a function of the strength of that 
error tendency in normal response 
hierarchies. The sugestion that schizo- 
phrenic error patterns seem to be 
exacerbations of normal error patterns 
(with supporting evidence) has also 
been made in other papers by Chapman 
(1958) and by Fey (1951). 
We have emphasized the relationship 
of schizophrenic conceptual errors to 
the strength of error tendencies in nor- 
mal response hierarchies. In the 
paradigm being used here, the strength 
of an appropriate dominant response 
also plays a rolein determining whether 
or not competing inappropriate re- 
sponses will increase under increased 
arousal. As was seen in the discussion 
of Figure 1, response-strength ceiling 
effects leading to an increase in com- 
peting responses are most likely when 
dominant response strength is high, 
Concept research with schizophrenics 
has usually not emphasized strength of 
appropriate conceptual responses as an 
independent variable. The stimuli 
have tended to be familiar, supposedly 
eliciting appropriate responses of high 
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strength. Stimulus-generalization stud- 
ies may provide relevant evidence con- 
cerning variation in strength of domi- 
nant responses as related to errors. 
We have some hesitation about citing 
stimulus-generalization studies in the 
present context because concepts are 
usually based on mediated generaliza- 
tion instead of primary stimulus gener- 
alization. However, as suggestive evi- 
dence, note that in stimulus-generaliza- 
tion tasks, schizophrenics as compared 
to normals do tend to exhibit reduced 
frequency of a dominant response pri- 
marily where the response is strong— 
at the training stimulus (Broen & 
Storms, 1964; Dunn, 1954; Garmezy, 
1952; Knopf & Fager, 1959). Recent 
evidence suggests that the same phe- 
nomena may occur in the area of medi- 
ated generalization (Chapman & Chap- 
man, 1965; Peastrel, 1964). In these 
studies, schizophrenic deficit in re- 
sponses which depended on mediated 
similarity occurred primarily to stimuli 
where mediated similarity was greatest. 

Consideration of the importance of 
task variables in determining type of 
concept disturbance in Schizophrenia 
suggests that the recent emphasis on 
schizophrenics' overinclusive concepts 
in the research literature may be a 
function of task conditions. We hy- 
pothesize that overinclusive and under- 
inclusive errors would occur with equal 
frequency in acute schizophrenics if 
task conditions were equated. Equat- 
ing task conditions would involve 
equating appropriate response strengths 
and competing response strengths for 
stimuli where inclusive and exclusive 
Tesponses are appropriate, and having 
equal numbers of both classes of stim- 
uli in the study (equal numbers of 
objects which would be correctly in- 
cluded and excluded). 

An additional point is that if schizo- 
phrenic conceptual errors are responses 
which are low in normal response 
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hierarchies, schizophrenic conceptual 
errors should be associated responses 
rather than irrelevant responses. Re- 
ferring back to the research example, 
this means that if there is overinclusion 
in selecting objects which are birds, 
the specific overinclusions would tend 
to be flying insects rather than irrele- 
vant objects such as articles of clothing. 
The research evidence indicates that 
schizophrenic conceptual errors do tend 
to be associated errors rather than ir- 
relevant errors (Chapman, 1958, 1961; 
Chapman & Taylor, 1957). 

The arousal paradigm also implies 
that schizophrenics should show more 
conceptual deficit under increased 
arousal (except when dominant appro- 
priate responses have low strength— 
see Figure l—or when an aversive 
stimulus is used specifically to provide 
new information or as an incentive— 
see Cavanaugh, 1958). The most 
relevant data we know on this point 
are again from stimulus-generalization 
studies. Broen, Storms, and Gold- 
berg (1963) tested the generalization 
of a motor response under different 
amounts of induced muscular tension 
(arousal). Increased arousal led to a 
decrease in the response at its training 
stimulus (overexclusion) and a facili- 
tation of the response to generalization 
stimuli (overinclusion). In another 
relevant study, Broen and Storms 
(1964) trained schizophrenics to press 
a lever in one direction in response to 
one category of pictures (tree with 
branch angles in a specified range) and 
to press the lever in the other direction 
in response to another category of pic- 
tures (different branch ^ angles). 
Lengthy training and probabilistic re- 
inforcement were used to insure that 
both appropriate and competing re- 
sponses had high habit strengths. An 
increase in induced muscular tension 
reduced the number of appropriate re- 
sponses and increased the number of 
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competing responses. In accordance 
with an hypothesis of higher response- 
strength ceilings in normals, hospital- 
ized nonpsychiatric subjects showed no 
significant decrement in appropriate 
responses or increase in competing re- 
sponses. These two studies are most 
directly relevant to the effect of arousal 
on conceptual responses when concepts 
are defined in terms of a range of stim- 
ulus similarity. In these studies the 
evidence is consistent with the implica- 
tion from the account that under certain 
boundary conditions increased arousal 
tends to increase competing conceptual 
responses at the expense of appropriate 
conceptual responses. We expect that 
increased arousal has similar effects on 
responses to other types of concepts, 
such as conceptual groupings deter- 
mined by functions of objects, but re- 
search which would confirm this gen- 
eralization is certainly needed. 


DELUSIONS 


We expect that transitory and rela- 
tively unstructured delusions will ac- 
company the types of association and 
concept disturbances we have been dis- 
cussing. When a person's categoriza- 
tions are underinclusive and overinclu- 
sive, the data which he sees as relevant 
to a problem will be deviant. In draw- 
ing conclusions some relevant data will 
not be used, and loosely associated data. 
which are normally excluded as unim- 
portant, will be seen as quite relevant. 
Delusional conclusions will result. 
These delusions will be transitory and 
unstructured because the underlying 
process involves fluctuations between 
appropriate and competing associations. 
It is important to note that increased 
arousal should lead to a syndrome 
which includes only transitory and un- 
structured delusions. Stable delusions 
may develop later ifa particular delu- 
sion reduces anxiety. Such stability of 
thought implies that the current level 
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of arousal is not sufficient to lead to 
response-strength ceiling effects. 
Cameron’s (1951) conclusions con- 
cerning delusions are quite similar to 
the implications of the present account. 
He distinguished two stages in the for- 
mation of a delusion—an initial diffuse 
and unstructured phase and a later re- 
strictive phase. The unstructured de- 
lusions are related to anxiety through 
a causal chain like that in the present 
account, namely the effect of anxiety 
in lowering the “relevance-to-irrele- 
vance” ratio of the information on 
which conclusions are based. The de- 
velopment of structure and stability 
would occur later if the delusional 
thinking comes to serve a defensive 
function. 

Relevant research information is 
sparse but supportive enough to sug- 
gest that further investigation based 
on the implications of the present ac- 
count may be useful As expected 
Írom the postulated relationship be- 
tween concept disturbance and delu- 
sions, recently admitted schizophrenics 
who have delusions are significantly 
more overinclusive in their thinking 
than either nondeluded schizophrenics 
or nonschizophrenic patients (Payne, 
Caird, & Laverty, 1964). This rela- 
tionship seems to be a function of 
chronicity and may be reversed in 
chronic deluded schizophrenics if delu- 
sions are the single prominent symptom 
as would be expected when a delusion 
serves a defensive function (Silverman, 


1964). 
HALLUCINATIONS 


An arousal-induced disorganization 
of response hierarchies may also under- 
lie the occurrence of some types of hal- 
lucinations. When internal or external 
cues evoke competing perceptual re- 
sponses, increased arousal may lead to 
a relative decrease in the dominance 
of appropriate perceptual responses, 
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thereby increasing inappropriate per- 
ceptions. 

The hallucinations which would re- 
sult from this breakdown in normal 
response hierarchies would tend to be 
disorganized and fragmented mixtures 
of appropriate and inappropriate per- 
ceptions. These hallucinations should 
also increase with increased arousal. 
In addition, because the hallucinatory 
perceptions are normally evoked as 
low-strength competing responses, the 
misperceptions should have some asso- 
ciation to the stimulus situation in 
which they occur. We will briefly 
discuss each of these implications in 
turn. 

The expectation that some schizo- 
phrenic hallucinations should be dis- 
organized mixtures of appropriate and 
inappropriate perceptions is different 
from the rather organized and mean- 
ingful reports of hallucinations which 
are usually presented in abnormal psy- 
chology texts. However, there is a 
fair amount of clinical evidence sug- 
gesting that schizophrenic hallucina- 
tions are often, even usually, such 
fragmented mixtures. For example, 
Kraepelin (1919) and Bleuler (1950) 
both report rather disorganized and 
fragmented content in schizophrenic 
hallucinations. Bleuler emphasizes the 
varied and abrupt nature of hallucina- 
tions and observes that sensory decep- 
tions may occur along with a correct 
observation of the same event, Dis- 
cussing hallucinated voices Bleuler 
(1950) states that: "As a rule. . . the 
patients hear short sentences or single 
words which in themselves need not 
always make sense. It is the patient 
who usually imputes some meaning to 
the words or sentences [p. 99]." This 
meaningful organization of mispercep- 
tions may be a secondary process which 
is learned because it is anxiety reduc- 
ing. As Kraepelin (1919) notes: 
“They [hallucinations] are almost never 
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wanting in the acute and subacute 


forms of the disease . . . at the begin- 
ning these are usually simple noises 
. and then develops . . . the hearing 


of voices [p. 7].” 

The hypothesized relationship be- 
tween increased arousal and hallucina- 
tions seems at first to be contrary to 
the discussions which have emphasized 
the causal role of low arousal, primarily 
being concerned with dreams during 
sleep (see Evarts, 1962). However, 
Harris (1959) has emphasized the 
point that this low-arousal process is 
different from the process leading to 
hallucinations in schizophrenia, a sug- 
gestion which is supported by physio- 
logical differences between dreaming 
and schizophrenia (Rechtschaffen, 
Schulsinger, & Mednick, 1964). Other 
writers point out that high levels of 
arousal or anxiety may also be asso- 
ciated with hallucinations (Fish, 1961; 
West, 1962; Zuckerman, Albright, 
Marks, & Miller, 1962). A high- 
arousal explanation of schizophrenic 
hallucinations also seems consistent 
with differences in frequency of differ- 
ent types of hallucinations. For ex- 
ample, hallucinations seem to have 
more stressful than nonstressful con- 
tent (e.g., accusing voices). Stressful 
thoughts would be expected to lead to 
higher levels of arousal than nonstress- 
ful content, increasing the likelihood of 
lack of discrimination of the source of 
the stimulation. We might also con- 
sider why unusual perceptions in 
schizophrenia are so frequently con- 
cerned with body sensations (note the 
MMPI Sc scale content) and are more 
frequently auditory than visual ( Malitz, 
Wilkens, & Esecover, 1962). The ex- 
planation may lie in different arousal- 
producing properties in different sen- 
sory modalities. If a particular kind of 
stimulation leads to more arousal, these 
stimuli will be more usually misper- 
ceived. In line with the dominance 


of body and auditory hallucinations in 
schizophrenia, Bernhaut, Gellhorn, and 
Rasmussen (1953, p. 33) state that 
“|. sensory stimuli listed in decreas- 
ing order of ability to produce an 
arousal reaction are: (i) pain (most 
effective) ; (ii) proprioception; (iii) 
auditory stimulation; (iv) optic stimu- 
lation (least effective)." 

The suggestion that hallucinatory 
perceptions should be perceptions 
which are associated with the situa- 
tion in which they occur also seems 
consistent with clinical observations 
and the results of a few relevant experi- 
ments. The association is most obvious 
when the misperception is simply an 
inaccurate localization of stimulation. 
For example, patients’ own thoughts 
appear to be spoken out loud, and 
actual external sounds may be per- 
ceived as coming from within (Bleuler, 
1950). The view that hallucinations 
are associated responses also receives 
some experimental support from stud- 
ies indicating that auditory hallucina- 
tions are more likely in the presence of 
audible sound (Harris, 1959; Riss, 
1959), and from Lindsley's (1963) 
finding that "in general, the more simi- 
lar the stimulus was to a human voice, 
the higher the probability that a given 
patient would ‘talk to it’ [p. 295]." 


REGRESSION 


In normal adults, those responses 
which are dominant in a situation tend 
to be appropriate to that situation. 
Competing responses may also be 
evoked at some strengths. These com- 
peting responses may be of at least two 
types: (a) responses which had been 
learned earlier but are no longer ap- 
propriate, and (b) responses which are 
generalized from similar situations. 
Increased arousal which is sufficient to 
equalize dominant and competing re- 
sponses at ceiling strengths will there- 
fore increase the proportion of re- 
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sponses which had been learned earlier 
in the situation (or related situations). 
In the sense that the probability of old 
responses is increased, regression is 
expected as one aspect of the syndrome 
of behaviors expected under high levels 
of arousal. However, the expected 
behavior is not simply a return to child- 
like patterns of behaving. The form of 
regression would be a relatively un- 
structured mixture of appropriate, 
generalized, and regressive responses. 

There is some evidence that schizo- 

phrenic regression is not a simple re- 
turn to childlike behavior patterns (see 
literature reviewed in Chapman, Bur- 
stein, Day, & Verdone, 1961), and that 
the lack of subordination of responses 
which are normally more hierarchically 
ordered is the form of regression 
which occurs in schizophrenia (Cam- 
eron, 1939). | Cameron's conclusions 
from an object-sorting task he used to 
study regression provide a good de- 
scription of the type of regression 
which occurs in schizophrenia. He ob- 
served that the schizophrenics were 
". . . unable to organize or subordi- 
nate the events occurring simultane- 
ously in their organism— perceptual, 
memorial, imaginal [p. 269]." While 
such behavior may have some childlike 
qualities, it would seem to be more 
aptly described as a decrement in the 
normal hierarchical ordering of alter- 
nate response tendencies. 

Much of the research suggesting that 
regression can be induced by increased 
arousal has been done with infrahuman 
subjects (see Kleemeier, 1942). How- 
ever, there is considerable clinical lit- 
erature and some experimental evi- 
dence suggesting that increased Stress 
or arousal is related to regression in 
humans (e.g., Barker, Dembo, & Lewin, 
1941; Levin, 1965; Noyes & Kolb, 
1958; Rodnick, 1942; Sadler, 1953). 

Levin's (1965) study provides an 
especially rigorous test of a relation- 
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Ship between increased arousal and 
regression implied by the paradigm 
used here. In this study, normal 
subjects first learned to associate num- 
bers to colors. In a second phase the 
correct associations were changed, and 
associations were tested under different 
degrees of arousal (varied by changing 
degree of induced muscular tension). 
At the time that arousal was varied, the 
responses which had previously been 
Correct (regressive responses) were the 
Strongest competing responses. The 
arousal paradigm implies that when 
arousal increases the strength of com- 
peting responses, the multiplicative ef- 
fect of arousal should favor the strong- 
est competing response (Levin’s re- 
gression response) to the point where 
this response is restricted by the re- 
sponse-strength ceiling. Arousal in- 
creased beyond this level would tend 
to increase the proportion of other, 
initially weaker, competing responses. 
In the Levin study when proportion of 
regression errors to other errors was 
related to arousal level, the inverted U 
relationship implied by the present ac- 
count was found. The Proportion of 
errors which were regressive responses 
increased as muscle tension increased 
from zero to low to medium tension, 
and then decreased under high tension, 

Another implication from the theo- 
retical account used in the present 
paper is that regression is more likely 
to occur if both presently dominant 
and older responses have high response 
Strength. Clinical literature and re- 
search reports (using animal subjects) 
have concentrated on only part of this 
implication, being concerned with the 
strength of the older or regressive re- 
Sponse as a determiner of regression. 
Most authors have agreed that there 
1s a positive relation between strength 
of the former response and likelihood 
Of regression (eg., Fenichel, 1945; 
Freud, 1949; Martin, 1940). 


TA 
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Thus there is some support for the 
hypothesis that a form of regression, 
the destructured mixture of mature, 
generalized, and previously appropriate 
behaviors which seems to occur in 
schizophrenia, may be the result of the 
same arousal-related process which de- 
termines other aspects of a schizophre- 
nic syndrome. 
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CUTANEOUS COLOR SENSITIVITY: 
EXPLANATION AND DEMONSTRATION 
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Past reports that humans can, in com 
fingers the colors objects would have 
ably been received with skepticism. 


A previously 
based on differential rate of 


absorption of infrared 


plete darkness, sense with their 
if illuminated, have understand- 
Proposed hypothesis, 


layers of the skin, is inconsisten 
situation. Quantitative anal 
temperature surface 
ever, leads to the c 


by the associated effects on sk 
demonstrations show that this is 


measurements confirm the theoretical 


Occasionally over the years popular 
accounts and clinical reports have 
appeared that describe a rather in- 
credible capacity of some individuals 
to sense with their fingers the prop- 
erties of objects that normally only 
the eyes can detect. Recent investi- 
gations, beginning with those of 
Goldberg (1963), Youtz (1963, 1964), 
and Nyuberg (1964), have increased 
the creditability of such reports to the 
point where they must be investigated 
to explain their apparent incompati- 
bility with the present concepts of 
sensory function. 

Some of these reports are probably 
explained by the inadequacy of blind- 
folds for preventing use of the eyes 
(see Gardner, 1966). Among these is 
Nyuberg's (1964) report on an in- 
dividual who identified printed char- 
acters and demonstrated trichromatic 
color sensitivity while blindfolded. 
In all respects, the subject's perform- 
ance was remarkably like what one 
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ysis of the system 
juxtaposed to the higher te 
onclusion that | 
different room-temperature surfaces 


radiation by different 
t with the thermodynamics of the 
consisting of a room- 
mperature skin, how- 
arge differences in emissivity of 
almost certainly could be detected 
temperature. Easily repeatable 
e, and a few simple temperature 
identification of the mechanism. 


in 
tru 


would expect if it were mei 
images reaching the retin 
the pinhole apertures tha 
exist at the edges of blindfolds 
(Gardner, 1966), Goldberg (1963) 
reported earlier tests on the same 
individual conditions 


diated by 
a through 
t typically 


required in an 


unlikely phenomena, 
In such cases, where the results seem 


very unlikely, and especially in cases 
where a small number of individuals 
claim an extraordinary ability, rather 
extensive precautions against trickery 
may be required as is advised by 
Gardner (1966). In all cases in which 
light is intended as a cutaneous stim- 
ulus (e.g., in Leont'ev's work de- 
scribed by Pick, 1964, p. 22), inter- 
pretation of the results must take 
into account the thermal effects of 
light on skin; “heat filters’ do not 
eliminate these effects, for no filter 
can pass light and not Pass energy 


which can be converted to heat upon 
absorption. 


Youtz's (1963, 19 
are not so easily e 
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Xplained as those 
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mentioned ‘above. He tested an 
exceptional individual who was able 
to identify the color of paper, plastics, 
and cloth, which were in the complete 
darkness of the interior of a light-tight 
box. (Color is used here in the tech- 
nical sense, encompassing brightness 
as well as hue.) The subject was 
required to wear a blindfold and to 
rest her forehead against the box more 
than a foot above two holes in the 
side of the box, to which were attached 
thick, black sleeves with elastic wrist 
bands, through which the subject was 
required to thrust her hands in order 
to make the identifications. Four 
different series of experiments were 
conducted in this manner. The sub- 
ject was correct 85-95% of the time 
in the first series, much less often but 
still significantly above chance in the 
second series, and at chance level in 
the third and fourth series. 

Youtz (1964, 1965) also tested 133 
Barnard College freshmen under the 
following conditions: To each he 
attached a blindfold and also a "bib" 
stretching from around the subject's 
neck to the top of the frame holding 
the test objects, thus completely hid- 
ing the test objects from the subject. 
He reports that 10% of the subjects 
were able to perform under these 
conditions consistently (not Type I 
errors) above chance. To explain the 
performance of these subjects, Youtz 
has advanced what he calls the ‘‘wave- 
length-temperature discrimination hy- 
pothesis” (1964, 1965). 

Unfortunately, Youtz has not yet 
published a full account of this work 
so that his procedure and analysis can 
be carefully studied. Thus one could 
attribute the gradual loss of ability 
of his exceptional subject to learning 
on the part of the experimenter, and 
the probability statistics in his study 
of Barnard students are not reported 
in enough detail to make the data so 
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far reported thoroughly convincing. 
Yet, although the wavelength-tem- 
perature discrimination hypothesis is 
almost certainly incorrect, as shown 
below, the conclusion that humans 
can perform discriminations of the 
type described by Youtz almost 
certainly is correct. 

The purposes of this paper, then, 
are: 


1. To point out the flaws in the 
wavelength-temperature discrimina- 
tion hypothesis. 

2. To show that we should not be 
surprised if people can discriminate 
objects in the dark, which appear to 
differ only in color when illuminated ; 
however, certain thermodynamic con- 
siderations and established sensory 
capacities would lead us to be sur- 
prised if people could not make such 
discriminations. 

3. To describe a simple demonstra- 
tion whereby the skeptical reader can 
prove to himself his ability to make 
such discriminations. 


WAVELENGTH-TEMPERATURE 
DISCRIMINATION 
HYPOTHESIS 


The inevitable consequence of ab- 
sorption of radiant energy, whether in 
the visible spectrum or not, is an 
increase in temperature of the absorb- 
ing entity. We know that the human 
skin is exquisitely sensitive to tem- 
perature changes: Under some condi- 
tions a human subject can detect a 
cutaneous temperature change of 
.001?C sec! (Hardy & Oppel, 1937 ; 
Hendler & Hardy, 1960; Hendler, 
Hardy, & Murgatroyd, 1961; Hensel, 
1950). Thusitisof no special interest 
that a subject can detect the differ- 
ences caused by illumination of the 
skin by different light colors unless the 
difference in energy absorbed is 
smaller than is likely to be detected 
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by the resulting effects on skin tem- 
perature, a capacity that has not yet 
been convincingly demonstrated. 
Discrimination of objects that are 
neither illuminated nor self-luminous 
poses a different problem, however, 
for the way in which the object serves 
as an energy source may not be 
obvious. To account for discrimina- 
tion of such objects, Youtz (1964, 
1965) has advanced the following 
hypothesis. 
The human skin, a nearly ideal 
black-body radiator under ordinary 
conditions (Barnes, 1963; Hardy, 
1934b; Hardy & Muschenheim, 1934), 
radiates energy within the dark, 
light-tight box containing the test 
objects. The energy radiated by the 
skin is absorbed by the test objects 
and reemitted at wavelengths deter- 
mined by their respective composi- 
tions, particularly by any dyes or 
pigments present. The rays re- 
emitted from the test objects penetrate 
the skin to varying depths before 
being absorbed, depending upon their 
wavelengths (Hardy & Muschenheim, 
1934, 1936; Oppel & Hardy, 1937a, 
1937b). Localized heating then oc- 
curs at the depths at which the rays 
are absorbed, and the subject hy- 
pothetically discriminates the relative 
depths at which the heating occurs. 
Necessary for this hypothesis, then, 
are: (a) detectable radiation by the 
test objects in wavelengths which 
penetrate skin to different depths, 
and (b) the human ability to dis- 
criminate slight differences in tem- 
perature, or differences in temperature 
change, at different depths within the 
skin. . ' " 
This hypothetical explanation is 
plausible enough. The lack of in- 
dependent evidence that humans can 
make the necessary b discrimination 
is only a minor drawback. . It is 
shown below, however, that the hy- 


pothesis fails to satisfy condition a; 
for, under the conditions of Youtz's 
experiments, practically all of the 
energy is radiated by the test objects 
in wavelengths that are absorbed too 
close to the skin surface to provide 
depth cues. 


Thermodynamic Considerations 


The foregoing conclusion follows 
from certain thermodynamic con- 
siderations that hold for all but very 
special conditions such as are arti- 
ficially arranged within the laboratory 
or are inferred to exist in exotic places 
like the interiors of stars. Since the 
hypothesis concerns only radiant heat 
exchanges, discussion of the effects of 
conduction and convection will be 
deferred until later, The principal 
point here is that a substance of a 
given temperature, such as the sub- 
ject's skin or one of the test objects, 
cannot radiate more energy at any 
wavelength than does an ideal radiator 
at the same (or higher) temperature, 
unless it receives energy from a higher 
energy source. If the substance in the 
lower energy state (the test object) 
receives energy from the substance in 
the higher energy state (the skin) 
only through radiation, to which the 
Present case is limited, then the 
energy absorbed cannot be reemitted 
(fluoresced) at a shorter wavelength 
than the wavelength at which it was 
originally absorbed (Stokes' law). 
This means that so long as the test 
objects are cooler than the skin, they 
always return less energy to the skin 
in wavelengths shorter than any 
particular wavelength than the skin 
emits in those wavelengths ; this, then, 
places an upper limit on the amount 
by which test Objects can differ from 
one another in their radiant exchanges 
with the skin. These limits will now 
be taken up quantitatively. 


CUTANEOUS Cotor SENSITIVITY 


The proportion of incident radiation 
absorbed rather than reflected by an 
opaque object at a given wavelength 
is expressed by its spectral emissivity, 
€. A perfect reflector reflects all 
incident radiation and therefore ab- 
sorbs none (« = 0): An ideal black- 
body radiator absorbs all incident 
radiation (e, = 1.0), but it emits 
energy according to Planck's radiation 
formula: 


_ 0A 
T eeen]? 


Hy 


where M, = spectral radiant flux den- 
sity in w cm mp7, \ = wave- 
length in u, T = temperature in K°, 
€ = 37.403 w cmd m^, C2 
= 14387.9 p° K. 

The energy radiated by the skin at 
a given wavelength, Æ, equals the 
product of its spectral emissivity, e,, 
and the energy radiated at that wave- 
length and temperature by a black 
body, Hy: 


Ex, C1À TË 
Ih, = a Fy, = me =p 

Then the maximum amount of 
energy of wavelength \ that could be 
returned to the skin by a juxtaposed 
test object, if it were a perfect re- 
flector, is the total amount emitted 
by the skin, Ah, (To simplify 
geometric considerations, the distance 
of the test object from the skin is 
assumed to be small relative to its 
length and width.) And the least 
amount of energy of wavelength i 
that could be returned to the skin by 
a juxtaposed test object, if it were an 
ideal radiator, is E, of which e£, 
would be absorbed by the skin. 

The net energy loss of the skin over 
the interval, A = a to à = z, is the 
difference between the energy radiated 
and the energy absorbed over that 
interval. The least that loss can be, 
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when the skin is juxtaposed to a 
perfect reflector, is zero; and the 
greatest it can be, when juxtaposed 
to an ideal radiator, is: 


AH max = — [ 5o Í e, Hnd 
? eu dA ! ea dA 
v a eM] + a eM Lp 


Then the interval, AH max, is the 
maximum range over which the test 
objects can vary in their effects on the 
skin. From this equation it is clear 
that this range depends upon the 
spectral emissivity of the skin, 6e,; 
upon the temperature of the skin, 77; 
upon the temperature of the test 
objects, T»; and upon the spectral 
band considered, a < À < s. 


Computations 


Radiation in different wavelengths 
can be classified according to whether 
skin is relatively transparent to it, 
relatively opaque to it, or of inter- 
mediate density with respect to it. 
These three categories partition the 
relevant part of the spectrum into 
three regions (Hardy, Hammel, & 
Murgatroyd, 1956; Hardy & Musch- 
enheim, 1934, 1936; Hendler & Hardy, 
1960; Hendler et al., 1961; Oppel & 
Hardy, 1937a, 1937b): Wavelengths 
that are, respectively, shorter than 
.8 p, longer than 3 p, and between 
.8 pand 34. For expository purposes, 
and because Youtz has implicated 
these three regions in his hypothesis, 
each will be considered separately. 

Itis clear at the outset that prac- 
tically no energy is emitted by the 
skin in the wavelengths shorter than 


.8 u, that is, 
E 
AHmax = I e, Hd e 0, 
0 


for the eyes are so sensitive to these 
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wavelengths that we should otherwise 
be able to see one another in the dark. 
To bias the result in favor of the 
wavelength-temperature discrimina- 
tion hypothesis, let e, and T, assume 
their maximum_respective values of 
1.0 and 98.6°F and let, 


.8 
f e, Hndi = 0. 
Jo 
Then, 
8 
BET --f Hyd 
= —1.13(10-9) w cm-?, 


(The negative sign indicates that net 
energy flux is outward from the skin.) 
Since the energies of photons in these 
wavelengths exceed 2.4 A 10-39 y 
photon, the ratio, 


—1.13(1072) w cm-? 
2.4(107*) j/photon 


= 4.7 (107*) photon sec-! cm- 


ll 


1.7 photon hr-! cm-?, 


shows that a square centimeter of skin 
at body temperature loses, on the 
average, no more than two photons in 
these wavelengths every hour. 

Consider now radiation in the wave- 
lengths between .8y and 34, which 
penetrates to intermediate depths 
below the skin surface. The spectral 
emissivity of the human skin in 
these wavelengths is reported to be 
between .75 and .95 (Barnes, 1963. 
Hardy, 1934b; Hardy & Muschen- 
heim, 1934; Oppel & Hardy, 19374), 
but again to bias the result in favor 
of the wavelength-temperature dis- 
crimination hypothesis, let e, and 7, 
assume their respective maximum 
values of 1.0 and 98.6?F, and let 7, 


be a room temperature of 72°F. Then 
AF max 
3 (XA 3 TNT 5dA 
(00 Jo em] ur o eTe] 
= —7.5 (10-8) +3.3 (10-6) 


= —4.2(10-5) w cm-? 
= — 10^? g-cal sec-! cm-?, 


The threshold radiant flux density 
for large skin areas, Hasen, depends 
upon a number of variables, but in 
this situation it lies between 1.5 x 10-3 
and 7.0/X 10-4 g-cal sec! cm-? (Hardy 
& Oppel, 1937; Hendler & Hardv, 
1960; Hendler et al., 1961; Lele, Wed. 
dell, & Williams, 1954). This is 
greater by two orders of magnitude 
than AH, the maximum possible 
energy exchange that could take place 
between the skin anda room-temper- 
ature object via wavelengths shorter 
than 34. One might argue that less 
energy is required to reach threshold 
at these wavelengths than at the 
longer wavelengths used in arriving 
at the above value of Hos, but just 
the opposite appears to be the case 
(Oppel & Hardy, 1937a), 

. 5 ol there is some error 
in the conclusion that the signal 
afforded by 10-6 g-cal sec-i Ae is 


with the much larger 
energy exchanges occurring via the 
longer wavelengths A » 3). 
exchanges, 


AA max 


--— I Hyd + f, "Hyd 


= 9.3(10-%) w cm- 
= 2.2(1079) g-cal sec- cn, 


These 


are More than 2000 times those oc- 
curring in the shorter wavelengths just 


nm 
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discussed. Computations show that 
variations of -+.013°F in Ts or T, 
or variations of +.001 in emissivity 
either of the skin or of the test objects, 
would cause variations in the value of 
AF max for the wavelengths longer than 
3u, as large as the entire value of 
AHmax for the wavelengths shorter 
than 3p, 

Then no detectable radiant ex- 
change occurs via wavelengths shorter 
than 3 y, and if depth of penetration 
provides the subjects with any in- 
formation, it must occur via wave- 
lengths longer than 3p. But all 
wavelengths longer than 3» are ab- 
sorbed at approximately the same 
depth (Hardy & Muschenheim, 1934). 
Furthermore, 90% of this radiation is 
absorbed within the outermost 30 u 
or so of skin (Hardy & Muschenheim, 
1936), which is keratinized tissue de- 
void of nerve endings and receptors 
(Ranson & Clark, 1953). Little more 
than 1% of incident radiation in these 
wavelengths penetrates even so far as 
the most superficial skin layers sus- 
pected of harboring thermal receptors, 
more than 1004 below the surface 
(Hendler et al., 1961; Zotterman, 
1959). That radiation in these wave- 
lengths does not penetrate any signifi- 
cant distance is supported by the 
finding (Hendler et al., 1961) that 
blackening the skin surface, which 
insures complete absorption of inci- 
dent radiation at the very surface of 
the skin, has no effect upon thresholds 
to such radiation. 

To recapitulate, then, a necessary 
condition for the temperature-wave- 
length discrimination hypothesis is 
that the test objects emit detectable 
radiation at wavelengths that pene- 
trate to different depths within the 
skin. But, as just demonstrated, 
practically all of the energy is ex- 
changed via wavelengths that are 
absorbed at a uniform depth, and that 
depth is very close to the surface but 


distant from receptors. Thus, the 
necessary condition is not satisfied, 
and the wavelength-temperature dis- 
crimination hypothesis cannot hold. 


Mechanism 


Despite the flaws of the wave- 
length-temperature discrimination hy- 
pothesis, it has led to an understand- 
ing of the situation it was intended to 
explain. The total radiant heat 
exchange between the skin at body 
temperature and a black body at 
room temperature, 2.2 X 10-? g-cal 
sec^! cm™?, is anywhere from 3-15 
times threshold, and thus it is cer- 
tainly large enough to be detected. 
Objects having sufficiently different 
spectral emissivities in the infrared, 
when held close to the skin, should 
produce detectably different rates of 
skin cooling, so long as a sufficient 
temperature difference is maintained 
between them and the skin. That 
subjects use this temperature cue as 
the basis of their discriminations is 
supported by some of their subjective 
reports (Youtz, 1964). 

One can, in fact, compute the 
differences in emissivity that should 
be detectable on this basis with any 
given combination of temperatures. 
Let real substances, having spectral 
emissivity, e,, replace the ideal sub- 
stances in the equation for AHmax, and 
let a, = 1.0, a = 0, ands = ce, then 
the net energy loss by the skin is the 
difference between the total energy 
emitted by the skin and the sum of 
energy emitted and the energy re- 
flected by the substance, all of which 
is absorbed by the skin (because 
e, — 1.0). Thus, 


AIL us l H3,dÀ «f eH, dA 
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difference as a function of temperature. (The 
independent variable for the solid curves is 
skin temperature, and T, = 72°F; the in- 
dependent variable for the broken curves is 
temperature of the test object, and 7. 
= 98.6°F.) 


The value of AH obviously depends 
on the spectral absorption curve of the 
particular test object under consid- 
eration. Not the most interesting, 
but nevertheless an instructive case 
results if de,/dà = 0; that is, if e, is 
constant. This case is instructive 
because it provides a basis for estimat- 
ing the approximate mean differ- 
ence between spectral emissivities, 
weighted by Planck's radiation for- 
mula, that should be detectable under 
various circumstances. Let the dif- 
ference in emissivity of the two 
test objects be written Ae. Then 
threshold, 


Hinresn -f Ae (Hy, — Hr,)dà 


= Ae Í (Hy, — Hy dA. 


Integration results in the Stefan- 
Boltzmann law, 
Huns, = Ae (TA — Te), 
e = 5.67 (10-2) wen Ka, 


Then: 
Ae = Henresn/Lo (TH on 1,5) ]. 


The solid lines in Figure 1 show the 
effects of skin temperature, 7,, on 
the likely upper and lower limits of 
Ae, corresponding to the upper and 
lower estimates of threshold, Hen; 
7.0(10-*) and 1.5(10-:) g-cal sec^! 
cm-^?, respectively. Here, temper- 
ature of the test objects, 7%, is 72°F. 
The upper limiting curve shows that 
skin at body temperature should 
mediate discriminations of objects 
differing by .32 in their infrared 
emissivity. When skin temperature 
drops to 83°F, the emissivity of 
different objects may have to differ 
by .8 or more to be discriminated. 
This is consistent with the reports 
that subjects cannot perform these 
discriminations when their fingers are 
cold (Youtz, 1964). On the other 
hand, if the threshold is closer to the 
lower limit, very fine discriminations 
of objects differing in emissivity only 
by about .10 should be Possible for a 
large part of the normal range of skin 
temperature (down to 90°F), and 
even down to 76°F, only 4°F above 
the temperature of the test objects, 
differences of .50 in emissivity would 
be detectable. 

The broken lines in Figure 1 show 
the analogous curves when temper- 
ature of the test objects, T, varies, 
and skin temperature, T,, is held at 
98.6°F. It is obvious from these 
curves that the discrimination would 
be more difficult, for example, on a 
warm summer day in a room lacking 
air conditioning. It is even possible 
that this accounts for some of the 
difficulty experienced by Youtz's ex- 
ceptional subject in the last series of 
experiments, done in the summer of 
1964. 

The discussion so far has concerned 
only discrimination of objects, but to 
a limited extent identification of their 
color might also be possible. Red 
objects have low spectral emissivities 


A 


— we 
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in the longer wavelengths of the 
visible spectrum, but blue objects 
have high emissivities in these wave- 
lengths. Consequently, red sub- 
stances are more likely to have low 
emissivities in the nearby infrared 
than are blue substances. Yellows 
would occupy an intermediate position 
in this respect. Examples of such 
correlations can be found in tables on 
transmission of colored glasses (Hodg- 
man, Weast, & Selby, 1960). Thus, 
if the subject were set to interpret 
differences in induced skin temper- 
ature in terms of hue, and if reference 
stimuli were available, limited but 
statistically reliable identifications 
should be possible, mostly those at 
the ends of the visual spectrum, in the 
reds and blues. This is precisely the 
nature of the identifications reported 
(Youtz, 1964). That is, the subjects 
reportedly made limited but sta- 
tistically reliable identifications of 
reds and blues only. It is also inter- 
esting in this connection that people 
usually consider red a "warm" color, 
and blue a “cool” color, which is in 
the direction consistent with this 
analysis. 


DEMONSTRATION 


'The effect discussed above is great 
enough that one can make the simple 
preparations necessary and test it for 
himself in 1 or 2 hours? The idea 
behind this demonstration is to maxi- 
mize the effects described above in 
order to establish the effectiveness of 
the principle involved. After the 
effect is revealed under the optimum 
conditions, determining the limits of 
the effect and determining such things 
as the variability between individ- 
uals are straightforward, research 
questions. 

? [n fact, it would make a good experiment 
or demonstration for an elementary psy- 
chology laboratory. 


Materials 


Two surfaces are required, one approaching 
an ideal radiator, and the other approaching a 
perfectreflector. Polished, unoxidized metals, 
having emissivities of well under .10 (Hodg- 
man et al., 1960) can be used as the reflector. 
A first-surface mirror is also suitable, but the 
ordinary second-surface mirror is not, because 
infrared rays are absorbed by the glass in the 
pathway to and from the reflecting suríace. 
‘A reflector in the form of a fairly thick (1 inch 
or so), 6-inch square plate is especially 
convenient, as the radiator can be formed 
simply by painting half of it flat black, which 
gives that half an emissivity of .90-.95 
(Hodgman et al., 1960). Next a frame is 
required that is thicker than the square test 
plate by about } inch; it should fit loosely 
around the test plate, so that the plate can 
easily be inserted into the frame and removed 
from it. The last requirement is a table with 
an opaque screen at one end, under which one 
can thrust his arms without being able to see 
the table. An opaque laboratory apron (some 
are available with optical densities exceeding 
10), tied about the neck and thrown back 
over the head, though uncomfortable, can 
serve in place of the opaque screen. A com- 
pletely dark room (or less than 1077 foot- 
candles illuminance) is an added luxury. If 
there is reason to suspect a subject of trickery, 
an aluminum box of the type Gardner (1966) 
recommends can be used. 


Procedure 


The first step is to place the plate inside 
the frame without the subject’s knowing on 
which side the painted half is located. If the 
plate is sufficiently uniform, one can both 
serve as subject and position the plate after 
first spinning it on the table, being careful not 
to receive tactile cues by touching the half- 
painted top surface. It is better, of course, to 
have someone else position the plate, perhaps 
using a set of random numbers. Chances of 
suggestion, or even of the possibility of 
telepathy, can be reduced by having the 
other person, who does not know its orienta- 
tion until he checks it after the subject has 
made his choice, spin the square. A me- 

f randomizing the plate 


chanical means of r 
orientation that provides no extraneous cues 
would be best of all, but the above techniques 


are adequate to demonstrate the point here. 
The subject makes his choice by resting the 
ball of his hand on one side of the frame and 
the tips of his fingers on. the other side; this 
permits him to hold his hand about 4 inch 


288 
125 
c^. . 
= o 
g 100 : on 
8 E " $ E 
Sors "C Ey 2 2 
C Aai e * ț a m 
E E 0 s ° i 
" CE . T 
Eoso|-" a x 
8 |. o 
Ex 
E] * HAND HELD OVER RADIATOR 
H O25 75 © HAND HELD OVER REFLECTOR 
L L L L 
os E 4 6 8 10 


TIME (MIN) 


Fic. 2. Skin temperature changes as a 
function of time juxtaposed alternately to a 
reflecting surface and to an absorbing surface. 
(See text.) 


above the test plate without touching it. 
Although it is not necessary, a simple de- 
tector can be attached so that a trial can be 
stopped before the subject has made his 
choice if he happens to touch the test plate. 
This is done simply by putting 1.5 volts 
across a bridge and monitoring the bridge 
balance with an oscilloscope. The skin and 
the test plate are wired so that any momen- 
tary contact between the skin and either half 
of the test plate shorts a very large resister 
in the bridge, throwing it off balance. As 
the subject moves his hand back and forth, 
from side to side, leaving it in place a few 
seconds each time, he notices a feeling of 
warmth on one side (some subjects feel a 
coolness, or “draft” on the other side); if 
he identifies the warm side as the unpainted 
half (or the side where he notices the draft 
as the painted half), he will be correct about 
90% of the time. The discrimination is 
slightly easier if the number of plate orienta- 
tions is limited to two, but the Probability of 
explanations based on chance rapidly be- 
comes infinitesimal when four positions are 
permitted. 


TYPICAL REsULTS 


All of the options mentioned above, 
except for mechanical orientation, 
have been tried in 8 sets of 10 trials, 
using six subjects, one of whom was 
totally blind. Three of the subjects 
were tested while wearing an alumi- 
num box of the type described by 
Gardner (1966). Each of the 8 sets 
individually attained statistical sig- 
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nificance (binomial test, .05 level); 
the lowest score was 8 correct out of 
10 trials, but because the plate had 
been given four possible orientations 
in that particular set, it was still 
significant (.001 level). 


Temperature Measurements 


The skin temperature of two sub- 
jects was measured with a thermo- 
couple embedded in the palms of 
their hands, as their hands moved 
back and forth above the two halves 
of a 6-inch square first-surface mirror, 
half painted flat black. Precise 
measurement of skin temperature by 
this means is difficult (Barnes, 1963; 
Davis, 1961, 1963; Hardy, 1934a ; 
Hardy & Stoll, 1954), but this tech- 
nique is adequate for the purposes 
here. 

Representative results are shown in 
Figure 2. The data points represent 
readings from a differential voltmeter, 
later converted to degrees centigrade 
from calibration curves. The ordinate 
shows changes in temperature rather 
than absolute temperature because 
most of the errors inherent in this 
technique appear as constant errors 
affecting measurements of absolute 
temperature, while relative changes 
in temperature are much more 
curately reflected (Hardy, 
Molner & Rosenbaum, 1961). 

The figure shows that temperature 
varied over approximately °C, in 
synchrony with the movement of the 
hand from one half of the plate to the 
other half. The discrimination de- 
pends not so much upon the magni- 
tude of temperature differences at- 
tained, however, as upon the rate of 
these changes (Hendler & Hardy, 
1960; Hensel, 1950; Lele et al., 1954), 
as reflected in the slope of the curve. 

he maximum slopes typical of the 
transition from one temperature to 
another as the hand moved from 


ac- 
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above one surface to above the other 
were .017?C sec-!, well above thresh- 
old under these conditions (Hendler 
& Hardy, 1960; Hensel, 1950). 

The fact that the temperature of 
the hand stayed well above the tem- 
perature it was at the beginning of the 
session (represented by the origin of 
the curve) shows that the equilibrium 
point of the hand is higher when its 
thermal exchanges with the ambient 
must pass through the test plate and 
its holder than when these exchanges 
are more direct, as one would expect. 


OTHER VARIABLES 


. The effects of convection, conduc- 
tion, and the cumulative effects of 
thermal exchange have been largelv 
ignored in the discussion so far. 


Convection 


Convection is minimal in the dem- 
onstration described above, as the air 
is almost completely enclosed in a 
small compartment, and the thermal 
gradient produced within the air 
space between the skin and the test 
objects produces a density gradient 
that tends to maintain the status quo. 
Convection was prevented in Youtz's 
experiments by interposing between 
the skin and the test objects a layer 
of "transparent" plastic instead of air. 


Conduction 


The thermal conductivity of air is 
.6 X 10-4 g-cal cm sec? cm-??C-! 
(Hodgman et al., 1960), and in the 
demonstration here, the hand is held 
about 1 centimeter from the suríace 
of the test object. If the skin were 
98.6?F and if the test object were 
72°F, a difference of 15°C, then the 
air would conduct 9 X 10-4 g-cal sec-t 
cm~. Air conduction of heat from 
the skin, then, is a linear function of 
the temperature difference between 
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the skin and the test object, and it is 
in the order of magnitude of the 
absolute threshold. 

The layer of paint covering half of 
the test plate, having a conductivity 
perhaps 1% that of the metal (Hodg- 
man et al., 1960) may appear to be a 
complication in assessing the effects of 
thermal conduction to the test plate. 
One might even wonder whether the 
insulating effect of the paint might 
account lor the discrimination rather 
than the radiant effects. The answer 
is that it does not, because the effect 
is in the wrong direction: The paint 
may retard, to some extent, the rate 
of thermal conduction when the hand 
is poised above the painted half of the 
test plate; the skin would therefore 
lose less heat when above that side 
and would feel warmer there. This is 
exactly opposite to the findings. The 
reason the paint is an ineffective 
insulator in this case is because it is so 
thin, and its conductivity is so much 
greater than that of the air that the 
heat conducted to it by the air is 
rapidly passed to the metal to which 
it adheres. 

Thermal conduction in Youtz's ex- 
periment would have been primarily 
through a medium of plastic rather 
than air. The plastic, being a solid, 
like the paint, would have had a 
thermal conductivity more than 20 
times that of air (Hodgman et al., 
1960), and the plastic layer is not 
likely to have been as thick as 1 centi- 
meter; consequently, thermal con- 
duction would have had a much 
greater effect in his experiments than 
in the demonstration here. How 
much greater depends upon a number 
of things, among which is in the area 
of contact between the subjects’ skin 
and the plastic covering. , This anal- 
ysis leads to the paradoxical conclu- 
sion that, the harder the subject 
pushed down on the test objects, the 
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less he would have been able to 
discriminate between them. 


Cumulative Effects 


The specific heat, density, con- 
ductivity, and spatial dimensions of 
a body, along with its thermal inter- 
actions with other parts of its en- 
virons, determine the cumulative 
effects of thermal exchange with any 
other body. The test object in the 
demonstration is an effective heat 
sink because heat is rapidly distrib- 
uted as it is received, and because its 
size, density, and specific heat permit 
it to absorb a lot of heat before its 
temperature rises greatly. Finally, 
because of its large surface area in 
comparison with its volume, the test 
plate rapidly loses heat to its sur- 
roundings by conduction and radia- 
tion, both of which are proportional 
to surface area. 

On the other hand, a test object 
lacking these properties would behave 
like a series capacitor in an electrical 
circuit; it would pass only transients, 
or changes in heat flow. Thus, a 
subject would have only a short time 
in order to make his judgment before 
the object warmed up to a temper- 
ature close to that of his skin, where 
discrimination is poor (see Figure 1). 

Moreover, because of the higher 
rate of heat exchange between the 
skin and the more emissive surface, 
the temperature of that surface would 
tend to rise more rapidly, and to 
attain higher values, than that of the 
more reflective surface. (Conversely, 

after the hand had been removed, the 
temperature of the more emissive 
surface would tend toward that of the 
ambient more rapidly.) As the tem- 
perature of the emissive surface rose 
an increasing distance above that of 
the reflective surface, the difference 
between the two corresponding rates 
of heat exchange with the skin would 


first diminish, next drop below thresh- 
old, then reverse, and finally rise 
above threshold in the opposite direc- 
tion. Thus, if the temperature of the 
more emissive surface rose a sufficient 
distance above that of the more 
reflective surface, the skin would lose 
more heat to the more reflective sur- 
face than to the more emissive one, 
and the phenomenal effect would be 
reversed: the more emissive surface 
would feel warmer. The relevant 
variables were not sufficiently well 
controlled in the demonstration to 
warrant an analysis of the conditions 
under which this might occur, but the 
advantage of combining the test ob- 
jects in a single piece of highly con- 
ductive material is not only that it 
provides a good heat Sink, but also 
that it maintains the two surfaces at 
a more nearly equal temperature than 
would otherwise be Possible without 
thermoregulatory equipment, 

In the demonstration, the vascular 
System provides the ultimate heat 
Source. The cutaneous thermal re- 
ceptor mechanisms lying between the 
blood supply and the Skin surface are 
complicated and controversial, and 
Since the conclusions of this paper 
depend Primarily upon stimulus-re- 
sponse relationships, these mecha- 
nisms will not be discussed. Readers 
are referred instead to three review 
papers for some of the different views 
receiving current attention (Nafe & 
Kenshalo, 1962; Weddell & Miller, 
1962; Zotterman, 1959). 


Optimum Conditions 


The following list summarizes the 
conditions facilitating the discrimina- 
tion of objects on the basis of their 
radiant properties : (a) The difference 
in spectral emissivities of the test 
Objects, in the wavelengths from 4 u 
to 20 u, should be maximum; (b) the 
temperature difference between the 
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test objects and the skin should be 
maximum, but the test objects them- 
selves should be at the same tem- 
perature; (c) the test objects should 
be good heat sinks, but they should 
not differ in this respect; (d) conduc- 
tion and convection of heat between 
the skin and the test objects should 
be minimum; (e) the medium between 
the skin and the test objects should 
pass radiation of 4,4 to 20 4 wave- 
lengths; (f) the test objects should be 
slightly larger than the size of the 
hand, to take full advantage of 
"spatial summation" (Hardy & Op- 
pel, 1937; Lele et al., 1954). 


Suboptimum Conditions 


A thermal basis for cutaneous dis- 
crimination between the test objects 
can be provided by differences in any 
of the six variables implicated in 
the preceding paragraph. Emissivity, 
however, is the most fundamental and 
perhaps the least obvious variable. 
Differences in more than one variable 
confound the effects of these variables 
in complicated ways that can produce 
opposite results under conditions that 
differ only subtly (eg. see the 
section on Cumulative Effects). 

Illumination of the test objects can 
also complicate the results. In gen- 
eral, illumination hinders the dis- 
crimination, both by augmenting the 
temperature rise of the test objects 
toward. that of the skin, where 
discrimination is difficult (see Figure 
1), and by increasing the temperature 
difference between the test objects 
(for discussion, see section on Cumu- 
lative Effects). Other complications 
follow from the different emission 
spectra of the skin and of the light 
sources. 

Although  well-illuminated test 
objects can rise above skin temper- 
ature, a rather high level of illumina- 
tion is required. For example, a 
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100-watt incandescent source must be 
brought within 1 foot of a surface to 
irradiate it at a rate comparable with 
that produced by the juxtaposed skin 
at body temperature. If the tem- 
perature of the test objects did rise 
above that of the skin, the surface 
that had at first cooled the skin more 
rapidly and that had felt cooler (the 
more emissive surface) would after- 
wards heat the skin more rapidly and 
would feel warmer. Thus, a series of 
correct judgments could appear con- 
tradictory and inconsistent if the 
temperature of the test objects fluctu- 
ated above and below that of the 
skin, crossing threshold on each side. 
Any differences between the temper- 
atures of the test objects induced by 
illumination would tend to bias the 
response in favor of warmer judgments 
for the emissive side. 1 

A number of recent experiments on 
cutaneous discrimination between dif- 
ferently colored objects (Barrett & 
Rice-Evans, 1964; Buckhout, 1965; 
French, 1965; Youtz, 1966) have 
treated illumination as an important 
variable. The fact that the subjects 
were permitted to touch the. test 
objects directly (except for the experi- 
ment of Barrett & Rice-Evans) sug- 
gests that the mechanism under 
investigation was the capacity of the 
test objects to store radiant heat for 
later transference to the skin by 
conduction. The interpretation of 
these experiments in most cases was 
complicated by: the effects of the 
shadow of the hand, the reflection of 
light to the skin by the test objects, 
heat conduction through an unknown 
area and duration of contact between 
the skin and the test objects, tem- 
poral changes in these variables ac- 
companying hand movements, and 
the omission of information on irradi- 
ance and on the thermal properties of 


the test objects. 
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NOMENCLATURE 


A number of names have been used 
to describe skin-mediated discrimina- 
tions of the type normally requiring 
vision. Some reference toskin is 
appropriate to denote the "sensitive 
organ and to distinguish? between 
cutaneous and retinal functions. Ref- 
erence to any particular skin layer, 
however, such as the dermis, is unduly 
specific. To differentiate between 
sensitivity of the type treated here 
from that which might imply a greater 
degree of spatial resolution, such 
terms as vision and sight should be 
eschewed for a term like sensitivity. 
Nor is the discrimination herein 
discussed analogous to color vision, 
for the multidimensional color space 
is compressed here into a single con- 
tinuum, rate of heat exchange. But 
rate of heat exchange between ob- 
Server and object is correlated with 
thehue of the Object as well as with its 
lightness, so that a general term that 
subsumes both Properties, such ascolor 
sensitivity, does serve to relate the sen- 
sory function to the visible differences 
between the objects discriminated. 
Because the discrimination actually 
depends upon thermal exchanges that 
are only Statistically correlated with 
visible Properties, however, perhaps 
cutaneous emissivity Sensitivity (or 


€ sensitivity) would be a more accu- 
rately descriptive term. 


IMPLICATIONS 


Interest in cutaneo 
tivity probably derives mainly from 
two sources: practical interest in 
exploiting new sensory abilities, par- 
ticularly as they might benefit the 
blind; and theoretical interest jn 
understanding the senses and the Ways 


in which they respond to Physical 
Conditions. 
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It is difficult to imagine any direct, 
practical application of this sensory 
capacity. Even under optimum con- 
ditions it is a nearly threshold effect. 
There is no evidence even of the 
crudest spatial resolution. Holding 
the hand perpendicular to the bound- 
ary between the reflective and emis- 
sive halves of the test plate provides 
no cues as to which half is which; 
these cues only arise after the hand is 
rotated 90° and placed successively 
over the two halves of the plate. No 
convincing evidence of the more visual 
kind of cutaneous sensitivity is known 
to the writer. 

Thus, the main interest Properly 
centers on theoretical implications. 
The phenomenon reported by Youtz 
would have been most interesting if, 
after having been verified, it had 
pointed to new senso 
to flaws in the understanding of 
previously known The 
demonstration, and its ready replica- 
validity of a 
Ty much like 

Z; the com- 
putations and temperature measure- 
a reasonable 
omenon that 


principles or sensory 
capacities. If this explanation ac- 


counts for all valid findings, then the 
phenomenon is of less interest than 
originally might have been expected, 
Until new explanations are ad- 
vanced, formal experiments in this 
area will be most easily interpreted if 
they provide the information neces- 
sary to determine the applicability of 
the present explanation to the results. 
minimum jn this respect is in- 
formation on the temperatures of the 
skin and of all test objects, and on the 
“Mssivities of the test objects. Ther- 
mal control of the test objects would 
€ of added value. Of further value 
would be information on the spectral 
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absorption curves of the test objects, 
the spectral emission curves of all 
radiant sources, the irradiance from 
all sources as measured on the test 
objects and on the skin, the thermal 
properties of the test objects, and 
perhaps even the results of calori- 
metric measurements. 
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3 loci of tension between behaviorism and phenomenology are dis- 
cussed. Logical analysis indicates that the methods of phenomenology 
as defined are not relevant for the construction and test of psycho- 
logical theory, although they may be valuable in other respects. 
2 rapprochements are discussed in which complementary roles are 
assigned to behaviorism and phenomenology. Also, the problem of 
terminological rapprochement is discussed. 


Only silence before the Thou . . . leaves the 
Thou free. . . . Every response binds up 
the Thou in the world of It. That is the 
melancholy of man, and his greatness. For 
that is how knowledge comes about . .. 
He will, indeed, have to grasp as an object 
that which he has seen with the force of 
presence, he will have to compare it with 
Objects, establish it in its order among 
classes of objects, describe and analyze it 
UEM, umm as It can it enter the 
Structure of knowledge è 
39-40]. ge [Buber, 1958, pp. 


In this paper, methodological dif- 
ferences between  behavioristic and 
phenomenological psychology are dis- 
cussed. The plan of the paper is as 
follows: First, a brief definition of 
behaviorism and phenomenology will 
show how these terms are conceived 
here. Second, an analysis will be made 
of three loci of tension between be- 
tween behaviorism and phenomenol- 


ogy. Third, the question of rapproche- 
ment will be discussed. 
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tion is also supporting ongoing work on the 
relation between science and religion; this 
work suggested some of the ideas contained 
in this paper. 
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BEHAVIORISM AND PHENOMENOLOGY 
DEFINED 
Behaviorism 


Behaviorism is conceived here in 
Bergmann’s (1956) sense of methodo- 
logical behaviorism. According to this 
position, the fundamental data of psy- 
chology are the publicly observable 
behaviors of organisms; these are 
conceived as being describable in an 
intersubjective observation language. 
For the behaviorist, private experi- 
ences per se are not admissable as 
data, but only their publicly observable 
symptoms, if any. Behaviorists differ 
as to whether it is necessary or at 
least advisable to assign to private 
experience any role at all in scientific 
psychology. Behaviorists who assign 
a role to private experiences would do 
so by referring to them by means of 
theoretical (as opposed to observa- 
tional) terms (see Eriksen, 1962; 
Jessor, 1958). 

If a behaviorist wanted to introduce 
into the vocabulary of psychology theo- 
retical terms referring to a person’s 
private experience, it would be neces- 
sary for him to (a) define the terms 
by multiple convergent operations, that 
is, by several distinct operations which 
jointly determine the intended empiri- 
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cal meaning (see Garner, Hake, & 
Eriksen, 1956), (b) develop the theo- 
retical relations between these terms 
and others, and (c) demonstrate the 
value of such terms for the explanation 
of behavior. These three conditions 
are not completely satisfied at present. 


Phenomenology 


Phenomenology is conceived in this 
paper in a sense similar to MacLeod’s 
(1964) interpretation (cf. Edie, 1964, 
for a more general discussion of phe- 
nomenology and its relation to Existen- 
tial philosophy). We shall define pure 
phenomenological psychology (ppp)? 
by its subject matter and by its method 
of investigation. The subject matter 
of ppp is all of experience. It is the 
intent of the ppp to study that which 
is immediately given in experience, 
MacLeod (1964) states in this con- 
nection : 


- colors and sounds are data; so are 
impressions of distance and duration; so are 
feelings of attraction and repulsion; so are 
yearnings and fears, ecstasies and disillusion- 
ments; so are all the relations—ranging 
from the crude and obvious to the delicate 
and intangible—with which the world pre- 
sents us [p. 51]. 


The ppp studies the data of experi- 
ence by a method which involves the 
suspension of implicit and explicit as- 
Sumptions. (This method is reminis- 
cent of what is called by Husserl, in 
a more technical context, “phenomeno- 
logical reduction” and the act of 
“bracketing” —1962, especially pp. 10, 
12, 96-100). That is, the Ppp seeks 
to investigate experience freed, as far 
as possible, from the encumbrance of 
any preconceived theoretical ideas 
about that which is being investigated, 
This is sometimes referred to by 


2 We shall use ppp to also stand for "pure 
phenomenological psychologist" and rely on 
the context to make clear which meaning is 
intended. 
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Gestalt psychologists as naïve phe- 
nomenology (see Koffka, 1935, p. 73; 
Köhler, 1947). There is an attempt 
to investigate the immediate data given 
to the mind and not a theoretical con- 
struction or interpretation of these 
data. In the words of Lewis: 


There are in our cognitive experience two 
elements; the immediate data such as those 
of sense which are presented or given to the 
mind, and a form, construction, or inter- 
pretation, which represents the activity of 
thought. Recognition of this fact is one of 
the oldest and most universal of philosophic 
insights [1956, p. 38]. 


For a similar distinction made by phe- 
nomenologists between “fully reflexive 
categorical thought” and “prereflexive 
awareness,” see Edie (1964). The 
subject matter of Ppp is the former 
element. 

Carried to its logical extreme, the 
method of ppp demands the bracketing 
of all concepts since they arise out of 
the activity of thought. It is therefore 
appropriate to call the type of experi- 
ence involved in the method of ppp 
nonconceptualized experience, that is, 
an experience of the immediately 
given, sometimes referred to as “brute 
facts” (James, 1910) or “raw feels” 
(Feigl, 1958), not mediated by any 
concepts. 

By characterizing Ppp in this way, it 
is not implied that it is ever possible 
to have a purely nonconceptualized 
experience. But the method of ppp 
implies that one can approximate such 
an experience. 


THREE Loci or TENSION BETWEEN 
PPP AND BEHAVIORISM 


Construction and Evaluation of 
Psychological Theories 


The first locus of tension involves 
the context of discovery (Reichenbach, 
1938) for psychology, that is, the con- 
siderations that govern the way in 
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which psychological theories and hy- 
potheses are discovered. We deal here 
with what is at least a stylistic differ- 
ence between ppp and behaviorism. If 
we restrict ourselves to the context of 
discovery, there is no reason why a 
behaviorist should not have the option 
of letting his explanatory efforts be 
initially guided by a phenomenological 
analysis of experience. He could then 
continue his investigation by collecting 
behavioral data. On the other hand, 
for the ppp, the former starting point 
is considered indispensable. This styl- 
istic difference can become the basis 
for the evaluation of theories and in 
this sense be extended into the context 
of justification (Reichenbach, 1938). 
Behavioristic theories which fail to 
represent some aspect of experience 
discovered by the method of ppp have 
been criticized by the latter as inade- 
quate (see MacLeod, 1964). 

Logical analysis of nonconceptualised 
experience. In order to evaluate such 
criticism, we shall present a logical 
analysis of nonconceptualized experi- 
ence. This analysis is influenced in 
part by the writings of philosophers 
about what we call nonconceptualized 
experience (see Ayer, 1956; Black, 
1954; Carnap, 1928, 1963; Lewis, 
1946, 1956; Northrop, 1948, 1962). 
Also relevant are discussions of mysti- 
cism which can be considered a type 
of nonconceptualized experience (see 
T. 1962; Stace, 1960; Zaehner, 

What type of knowledge (the term 
knowledge is here understood in its 
usual conceptual Sense) can be gained 
by an experiencer who has nonconcep- 
tualized experiences and who makes no 
attempt to introduce any hypothetical 
or theoretical construction into his dis- 
cussion of his experiences? It is our 
contention that a person who restricts 
himself in such a way cannot have any 
knowledge at all. 


In order for an experiencer to be 
able to claim that his experience has 
led to some sort of knowledge, he has 
to be able to give a description of the 
content of his experience for himself 
or others. Strictly speaking, any state- 
ment about a nonconceptualized ex- 
perience involves an interpretation; 
this carries a conceptual element into 
the description. Let us examine more 
closely the types of inferences which 
are involved in such statements. Con- 
sider the statement “I saw red,” or 
perhaps, “Redness was experienced by 
me,” as an immediately given. We 
normally infer from such a statement 
that the experience of redness was not 
merely a figment of the experiencer's 
imagination. In addition, the state- 
ment is usually considered to imply 
that other persons, or the same person 
on another occasion, would experience 
redness if placed in the same situation. 
It is reasonably clear that such infer- 
ences would be logically excluded if 
one were restricted to reporting only 
what is immediately given in noncon- 
ceptualized experience. But can such 
a reporter legitimately affirm that he 
experienced redness? In order to as- 
sign properties such as red to an ex- 
perience, it would be necessary to ac- 
cept the division of the color continuum 
into the particular set of categories 
used in our everyday language. This 
classification carries a conceptual ele- 
ment into the description of the non- 
conceptualized experience (see Brown, 
1965). 

In general, whoever attempts to de- 
scribe a nonconceptualized experience 
can do so only by a process of con- 
ceptualization which distorts and im- 
poverishes the experience. Therefore, 
an experiencer who attempts to de- 
scribe his nonconceptualized experi- 
ence without such loss of content must 
in fact be speechless. In a similar 
vein, Black (1954) asserts that state- 
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ments about the immediately given are 
not to be considered as descriptions 
but only as exclamations, as it were. 
Of course, a person can make a state- 
ment about his nonconceptualized ex- 
periences, but such statements are 
logically incommensurable with the ex- 
periences per se and can at best only 
be correlated with them, because state- 
ments per se involve the use of class 
concepts, whereas the idea of noncon- 
ceptualized experiences logically ex- 
cludes the applicabiilty of class con- 
cepts. Northrop (1962, especially pp. 
154, 237, 238) calls the relation be- 
tween such incommensurables “epis- 
temic correlation" (see also Havens, 
1961). 

The discussion of nonconceptualized 
experiences will now be summarized 
and partially extended by the presen- 
tation of a list of characteristics, each 
of which is logically implied by the 
meaning of nonconceptualized experi- 
ences. The characteristics assigned to 
Such experiences may not be independ- 
ent of each other. 


l. During a nonconceptualized ex- 
perience, the experiencer is not capable 
of distinguishing himself conceptually 
as experiencer from that which he is 
experiencing and is in this sense in 
union with it, 

2. During a nonconceptualized ex- 
perience, the experiencer is not capable 
of assigning a locus to his experience 
in a public spatio-temporal coordinate 
system. Thus, the experience could 

z only have a locus in a private frame 
of reference with respect to space and 
time. With respect to time, the ex- 
perience is called a specious present 
(see James, 1890, Ch. 18). 

3. Nonconceptualized experiences 
are ineffable in the sense that during 
and after the nonconceptual 
ence the experiencer has 
disposal words 


ized experi- 
not at his 
which could be used 
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by him to describe for himself or others 
the content of his experience. A for- 
tiori, they are propertyless, since prop- 
erties involving class terminology can- 
not be assigned to them. This implies 
that differences, if any, contained in 
the experience cannot be described ; it 
remains in this sense an undifferenti- 
ated unity. 

4. Nonconceptualized experiences 
are solipsistic. Since they cannot be 
verbally communicated to others, they 
have meaning, if any, only for the 
experiencer himself, 


Two conclusions about the method 
of ppp. First, the advocates of ppp have 
criticized the method of analytic intro- 
spection, which had been used by the 
Structuralists, on the ground that their 
method was not a true investigation of 
experience, since the search for ele- 
ments of experience leads to the intro- 
duction of elements where, in fact, the 
immediately givens are wholes, Thus, 
the method of investigation partially 
dictates its outcome (see Kóhler, 1947 ; 
MacLeod, 1964). A similar argument 
can be made with respect to ppp: Its 
method emphasizes personal as dis- 
tinguished from public time (see May, 
1958, pp. 65-71). This is understand- 
able as a logical consequence of its 
method (see Characteristic 2 above): 
In general, it will sometimes be diffi- 
cult to know whether ppp has chosen 
a certain theme only as a logical con- 
Sequence of its method of investigation, 
or in its own right because of its im- 
portance as an essential aspect of hu- 
man experience, or as a result © 
confounding both elements. 

Thus, paradoxically, the attempt t° 
bracket all concepts in order not tO 
bias the results of an investigation O 


à : cim c 
experience introduces certain episte™ 


. E, y c 
Consequences which tend to influenc 


: . : Q 
the results of an investigation a" 
creates a bias. 


= 
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The second conclusion suggested by 
this analysis of nonconceptualized ex- 
periences is that these experiences do 
not provide the basis for the construc- 
tion of any conceptual system. This 
conclusion follows directly from the in- 
effability of these experiences. There- 
fore, mnonconceptualized experiences 
cannot be used as a basis for the 
criticism of any conceptual system. It 
is in this sense that we understand 
MacLeod’s (1964) statement that ppp 
is only propaedeutic to the develop- 
ment of psychological theory. 


Differences in Types of Explanation 


The second locus of tension to be 
discussed here concerns the possibility 
of explanation in behaviorism and ppp. 
Explanation in science is generally 
thought to involve the use of some 
theory (Hempel & Oppenheim, 1948). 
Thus, strictly speaking, such a process 
of explanation is not compatible with 
the method of ppp. 

Two possible alternatives can be 
adopted by the ppp: First, he can re- 
fuse to construct any explanatory sys- 
tem and rely rather on some kind of 
empathic or intuitive understanding 
which cannot be verbally communi- 
cated at all. Some Existential psycho- 
analysts have apparently used this 
approach. For example, Tymieniecka 
(1962, p. 69) mentions three fun- 
damental "empirical" statements ad- 
vanced by Jaspers. These are: 

1. Man in his entirety cannot be objec- 
tively (conceptually) known; 

2. Cognition is not limited to causal ex- 
planation ; 

3. There are essential aspects of mind 
which are by their very nature not accessible 


to experimental methods and cannot be 
rationally grasped. 


Second, the ppp can at one time take 
the approach mentioned above. At an- 
other time, he can transcend the phe- 
nomenal world and construct a system 
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which is conceptual and which is be- 
lieved to preserve some of the intuitive 
insights obtained in nonconceptualized 
experience. But in such a case, ppp 
has only been a heuristic device limited 
to the context of discovery. It is not 
clear how the insights obtained by 
these methods can be carried over into 
the actual construction of theories. 


Methods of Test 


The third locus of tension concerns 
the method of test. For the behavior- 
ist, any psychological theory, law, or 
statement must be ultimately testable 
with reference to intersubjectively ob- 
servable symptoms of phenomena. The 
advocates of ppp, in so far as they 
do not exclude any theoretical state- 
ments, may differ as to their agreement 
with this demand of the behaviorist. 
For example, Rogers (1964) would 
agree with this demand. But whoever 
accepts this demand becomes in effect 
a methodological behaviorist, since his 
conceptual system rests ultimately on 
a behavioristic base, and ppp is used 
only in the context of discovery. Other 
advocates of ppp would reject any 
such insistence. 

What role, if any, can be assigned 
to nonconceptualized experiences 1m 
the testing process? At one time or 
another, several philosophers have sug- 
gested that nonconceptualized experi- 
ences provide the basis for testing all 
scientific statements (see Carnap, 1928, 
p. 273; Lewis, 1946, 1956; Russell, 
1912, Ch. V; Schlick, 1934). 

To our knowledge, no one has ever 
presented in sufficient detail a logical 
analysis of such a testing process. We 
shall discuss an approach by Schlick 
(1934) because he comes very close 
to this desideratum: He takes his 
point of departure from the idea that 
the test of a scientific statement always 
ultimately involves the derivation of 
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some statement describing some ob- 
servable phenomenon. Schlick, in 
common with other philosophers of 
science, calls the sentences in which 
the results of observations of this kind 
are reported "protocol sentences." 
Such a sentence may for example have 
the form: “John Doe perceived at that 
and that time, at that and that place, 
this and that." Protocol sentences may 
thus provide a basis for testing scien- 
tific statements, but not an absolutely 
certain one. They are hypothetical; it 
is conceivable that despite great care 
the sentence written down by an ob- 
server in the laboratory protocol is 
false. 

Schlick, however, argues that there 
is another kind of sentence which pro- 
vides a more suitable basis for the test 
of scientific sentences than protocol 
sentences. He calls them “Konstatier- 
ungen" and characterizes them as fol- 
lows: They are descriptions of what 
is immediately experienced and always 
have the demonstrative form “here 
now this" for example, "here now 
blue," without any reference to an 
experiencer, Konstatierungen can be 
known with absolute certainty, though 
only in the fleeting moment of the 
occurrence of the experience mentioned 
in the Konstatierung. In addition, Kon- 
statierungen and the certainty which 
attaches to them are private. A Kon- 
statierung cannot be written down nor 
be communicated in words to others 
because, for lack of general definitions, 
the meaning of the words "here," 
"now," "this" can only be indicated 
by demonstrative gestures, like point- 
ing, and not in words. 

Thus, Schlick's Konstatierung can 
be understood as a description of a 
nonconceptualized experience by an 
experiencer during the moment of his 
experience. We shall revert presently 
to the difficulties of assigning any role 
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to Konstatierungen in the testing proc- 
ess of scientific sentences. (For criti- 
cal comments about Schlick’s paper, 
see Ayer 1953, especially pp. 10, 11, 
90, 91; Nagel, 1954, p. 219.) 

Despite the importance Schlick as- 
signs to Konstatierungen in the testing 
process, they cannot, for him, be the 
basis for the construction of theories, 
because Konstatierungen are only mo- 
mentary and have passed when one 
starts the construction. 

A position similar to Schlick’s, as to 
the role of nonconceptualized experi- 
ences in the testing process, has been 
taken by Lewis. He states that “with- 
out . . . apprehensions of direct and 
indubitable content of experience, there 
could be no basis for any empirical 
judgment, and no verification of one 
[1946, p. 182]." In his book, Mind 
and the World Order, Lewis (1956, 
Chs. 2-5) discusses this in detail. He 
calls the qualitative characteristics of 
an immediately given "qualia." They 
have no names as distinguished from 
the properties of objects. For Lewis, 
absorption in the immediately given 
transcends conceptual thinking alto- 
gether, whereas knowledge of objects 
involves conceptual interpretation. 
Though the experience of a quale is 
ineffable, Lewis (1956, p. 125) takes 
as an example the statement, “this is 
the same ineffable ‘yellow’ that I saw 
yesterday.” Speaking in this way 
about the “ineffable yellow” seems like 
a contradiction in terms, and, inter- 
preting it as the same as yesterday, 
contradicts, in our opinion, character- 
istics which Lewis assigns, as we 
do, to nonconceptualized experiences, 
namely, that they are not only ineffa- 
ble, but propertyless and private as 
well. 

Unless these contradictions are re- 
solved, we do not see how Lewis is 
entitled to state that “without the cor- 
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relation of concept and qualia no ex- 
perience could verify or fail to verify 
anything [p. 144].” 

The treatment of the testing process 
presented by both Schlick and Lewis 
is not compatible with the logical 
analysis presented in this paper: First, 
the question of the certainty of non- 
conceptualized experiences is a contro- 
versial one in philosophy (see Allport, 
1960, p. 139; Ayer, 1956; Black, 1954; 
Born, 1924, pp. 4-5; Goodman, 1952; 
Tymieniecka, 1962, pp. 3-16; Weyl, 
1949, p. 116). In addition, whatever 
position one takes about certainty, it 
is clear that in order to test a scientific 
Statement by a nonconceptualized ex- 
perience, or by a Konstatierung which 
purports to be a description of such 
an experience, the experiencer would 
have to abstract with reference to some 
relevant property of the experience. 
Otherwise, he could not know whether 
or not the content of his experience 
confirmed or disconfirmed the state- 
ment. Let us consider an example: 
Suppose an experiencer were to test 
a sentence referring to the occurrence 
of a coincidence, for example, between 
the pointer of an instrument and a 
mark on a scale. The experiencer 
might then have an experience which 
he might reasonably conceptualize, at 
a later time, as a coincidence. How- 
ever, during the moment of his non- 
conceptualized experience, he cannot 
know whether what he is experiencing 
18 or is not what, in the sentence, is 
meant by a coincidence, unless he 
makes a conceptual interpretation of 
his experience. If the experiencer, 
during his nonconceptualized experi- 
ence, could know whether what he is 
experiencing is a coincidence, then he 
should be able to communicate his 
experience to others. Conversely, that 
which cannot be described to others 
cannot be described to oneself in terms 


which are clearly understandable to 
others. We do not see how Lewis or 
Schlick could overcome this objection. 
Thus, on the view presented here, non- 
conceptualized experiences per se can- 
not be used to test scientific statements. 

As mentioned previously, ppp can- 
not be used as a basis for the construc- 
tion of theories, nor, as we see now, 
for the purpose of testing any scientific 
statement, and ppp cannot even be 
used as a means of describing experi- 
ence. Thus, on the view presented 
here, ppp is not relevant per se for 
scientific psychology. However, the 
analysis may have some relevance for 
understanding other important aspects 
of ppp. 

As mentioned above, any concep- 
tualization of nonconceptualized expe- 
rience can lead to a feeling that the 
conceptual statement has impoverished 
and distorted the experience. The 
feeling can be particularly strong, for 
example, with respect to aesthetic ex- 
periences, where we may often feel the 
sheer inability of language to capture 
the subtlety of such an experience (see 
Schachtel, 1951). This feeling of im- 
poverishment and distortion may be 
accompanied by a feeling of disunion 
between, or separation from, experi- 
encer and that which is experienced. 
These feelings can become particularly 
acute when the object of our concern 
is another human being. Conceptuali- 
zation, by assigning properties to a 
person, is sometimes even considered 
as a dehumanization. Such concerns 
are prevalent among Existential psy- 
choanalysts and explain in part their 
rejection of Freud (see May, Angel, & 
Ellenberger, 1958). 

We have quoted Buber at the begin- 
ning of this paper because it seems to 
us that he has clearly understood the 
advantages and disadvantages inherent 
in the act of conceptualization. Ex- 
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pressed in Buber’s (1958) terminol- 
ogy, an experiencer has the following 
choice between two modes of relation: 
Either he may choose to have, in a 
receptive moment, a direct experience 
of another human being, a Thou, with- 
out separation by intervening concepts. 
But there can be no knowledge in the 
I-Thow relation. Or, he may choose 
to have knowledge, to describe, and 
analyze objectively. But in this J-/¢ 
relation, man experiences another hu- 
man being in separation from him. 

A position similar to Buber’s has 
been taken by Tillich (1960). He 
argues, in effect, that certain noncon- 
ceptualized experiences though not 
valuable for science at all are valuable 
in another way, namely, as a basis for 
religion. Similarly, on allegedly em- 
pirical grounds, Maslow (1962) has 
argued that peak experiences (a form 
of nonconceptualized experience) are 
valuable in that they promote psycho- 
logical growth and health. What is 
maintained here is that any advantages 
inherent in nonconceptualized experi- 
ences are obtained at the price of their 
irrelevance per se for science. Also, 
the view presented here should not be 
interpreted as implying that one should 
not or cannot take nonconceptualized 
experiences as an object of scientific 
investigation. For example, the study 
of mysticism represents an interesting 
and legitimate scientific endeavor. 


QUESTION or RAPPROCHEMENT 


Despite the fact that the analysis 
presented here does not permit any 
role assignment to ppp in scientific 
psychology, it is of some interest to 
examine the difficulties confronted by 
various approaches to rapprochement 
which would be faced by those psy- 
chologists who would not entirely ac- 
cept the logical analysis presented here. 


Complementary Roles for ppp and 
Behaviorism 


A possible basis for rapprochement be- 
tween ppp and behaviorism can be 
developed by assigning to them com- 
plementary roles in a unified psychol- 
ogy. We will examine two such pos- 
sible bases for rapprochement. 

A first basis would be afforded by 
the following two principles: 


l(a). Only the methods of the be- 
haviorist (and not those of the ppp) 
can be used to construct theoretical 
systems. 

1(b). Only the methods of the ppp 
(and not those of the behaviorist) can 
be used to test theories, (Of course, 
lb is contrary to the position taken 
in this paper.) 


To our knowledge, such a basis for 
rapprochement has never been sug- 
gested by any psychologist. It is com- 
patible with the position of such phi- 
losophers as Schlick and Lewis, Tt is 
not at all clear that the ppp would 
be satisfied with this abstention from 
theorizing as suggested by la above. 
As for condition 1b, most behaviorists 
would probably find this rather contro- 
versial The behaviorists’ insistence 
on an intersubjectively observable base 
for testing statements and their de- 
mand for observations which result in 
a high degree of intersubjective agree- 
ment tend to restrict the class of state- 
ments which could be directly tested 
to protocol sentences about publicly 
observable symptoms. No such re- 
striction is implied by a method of test 
based on nonconceptualized experi- 
ences. In general, the class of state- 
ments which can be directly tested 
by nonconceptualized experiences ap- 
pears to be larger than the class of 
statements which refers to experiences 
for which high degrees of intersubjec- 
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tive agreement can be obtained. Thus, 
if one can allegedly experience emo- 
tions as immediately given, then one 
could test directly statements about 
emotions. If one believes the mystics 
and accepts condition 1b, it would be 
possible to have a nonconceptualized 
experience of the divine, thus provid- 
ing a basis for the direct test of state- 
ments about the divine. If noncon- 
ceptualized experiences are used as a 
E of testing statements instead of 
: ersubjective tests by behavioristic 
Symptoms not only is the rational 
an gn of the testing process 
nE but also the class of state- 
ae irectly testable becomes larger. 
ditio: b it is possible to add to con- 
solid b an arbitrary restriction which 
kids Vitiate the behaviorist’s probable 
E E "I to the enlargement of the 
y s o testable statements. Namely, 
pete possible to restrict the use of 
E onceptualized experiences to the 
baa test of those statements which 

e directly tested by behavioristic 
procedures, for example, to protocol 
sentences. If this is done, the class 
of statements which are directly test- 
able is not enlarged. However, the 
addition of such a restriction woul 
violate the intent of the ppp to study 
all of experience and to afford equal 
significance to all aspects of experi- 
ence, Such a restriction would, in 
effect, divide the class of nonconcep- 
tualized experiences into two sub- 
Classes, one of which is assigned the 
role of testing scientific statements, and 
the other is not assigned any such role 
at all. 

A. second basis for rapprochement 
follows below : 

2(a). Only the methods of ppp (and 
not those of the behaviorist) can be 
used to construct theoretical systems. 
(Of course, 2a is contrary to the posi- 
tion taken in this paper.) 
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2(b). Only the methods of the be- 
haviorist (and not those of the ppp) 
can be used to test theoretical systems. 


This is the basis of rapprochement 
suggested by Rogers (1964). The 
principal difficulty in connection with 
this basis for rapprochement involves 
the behaviorist’s possible objection to 
condition 2a. Such a condition appar- 
ently restricts the theorist's ability to 
freely postulate theoretical concepts. 
Einstein (as cited by Holton, 1965 °) 
has expressed this idea as follows: 
Everything conceptualizable is constructive 
and not derivable in a logical manner from 
immediate experience. Therefore we are in 
principle completely free in the choice of 
those fundamental conceptions upon which 
we found our rendition of the world. Every- 
thing depends only on this: to what extent 
our construction is suitable for bringing 
order into the apparent chaos of the world of 


experience. 


Condition 2a implies that the theorist 
is permitted to introduce only those 
concepts which somehow have a 
foundation in what is given. 


Terminological Rap prochement 


Any rapprochement between ppp 
and behaviorism would be dependent 
on agreement on terminology. There 
are surely terms in the vocabularies of 
ppp and behaviorism which could be 
successfully coordinated. Terms re- 
ferring to certain sensory experiences, 
for example, “red,” can be coordinated 
and assigned compatible meanings for 
both ppp and behaviorism. Other 
terms, the theoretical status of which 
has not been extensively developed 
and which are close to ordinary experi- 

3 This quote is an English translation by 
Gerald Holton contained in the paper pre- 
sented by him at the 11th International 

of Science, Warsaw 


f the History 
Congress o! a OE man 


and Krakow, August 1965. 4 r 
text is contained jn Einstein s unpublished 


papers. 
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ences, might also be assigned compati- 
ble meanings, for example, terms re- 
ferring to emotional states, such as 
reverence. However, the fulfillment 
of this condition is dependent on 
the progress of psychological theory. 
As behavioristic psychology develops, 
terms which are used in a relatively 
atheoretical way, for example, “fear,” 
acquire modified meanings by virtue 
of having entered into a network of 
theoretical laws and relations which 
serve to specify the meaning of the 
term. It is difficult to imagine how 
the modified meaning of such terms 
can be directly experienced or coordi- 
nated in any sense with an immediately 
given experience. A direct coordina- 
tion and compatibility of meanings is 
possible only after their reduction to 
directly observable symptoms. An- 
other obstacle to the fulfillment of a 
terminological rapprochement consists 
in the introduction of terms into a be- 
havioristic psychology which have no 
counterpart at all in nonconceptualized 
experience, for example, terms con- 
tained in a highly formalistic mathe- 
matical model. Thus, as behavioristic 
psychology develops, the possibility of 
rapprochement between ppp and be- 
haviorism may grow more remote in 
this respect. On the other hand, it 
can happen that theoretical psychology 
will develop in such a way that there 
will remain many aspects of noncon- 
ceptualized experience which have lit- 
tle counterpart in the terminology of 
behavioristic psychology, for example, 
the experience of disgust is at present 
without any extensively developed 
counterpart in the theoretical language 
of behavioristic psychology (see Mac- 
Leod, 1964). 

There is one way in which future 
developments within behavioristic psy- 
chology might lead to a greater degree 
of terminological agreement between 


ppp and behaviorism. As Zener 
(1958) has pointed out, the insistence 
that science deals only with statements 
about intersubjectively valid observa- 
tions must be qualified if it is to be 
properly understood. This statement 
should not be taken to imply that only 
statements about observations where 
the intersubjective agreement among 
a large class of observers is almost 
perfect should be considered as data 
in science. Since the degree of agree- 
ment with respect to protocol sen- 
tences is never perfect, each psycholo- 
gist is faced with a somewhat arbitrary 
decision about the degree of agreement 
with regard to some observation that 
he will demand. Zener would, for 
example, have the psychologist study 
reports about aesthetic qualities of ex- 
perience which might be based on a 
less than perfect agreement among a 
small group of individuals with refined 
aesthetic sensibilities. To the extent 
to which behavioristic psychology is 
open to reports about subtle qualities 
of experience and is willing to accept 
them as data to be explained, there 
should be an increased possibility of 
terminological agreement between ppp 
and behaviorism, 
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The relative adequacy with which descriptive behaviorism and cogni- 
tive learning theory can account for the findings of verbal conditioning 
experiments was examined along with the methodological and epis- 
temological assumptions on which these interpretations are based. 
Methodological problems associated with the techniques employed for 
the assessment of mediating cognitive states (awareness) were dis- 
cussed, and evidence was presented to support the view that “what is 
learned” in verbal conditioning is awareness of a correct response- 
reinforcement contingency. The implications of verbal conditioning 
findings for the more general question of learning without awareness 
were considered, and it was concluded that the investigation of aware- 
ness in relation to other behavior is requisite to the development of 


an adequate theory of human learning. 


Operant behavior generally refers to 
responses which an organism in a given 
environment displays spontaneously 
without special training. Operant con- 
ditioning usually implies a variety of 
experimental techniques wherein a sub- 
ject is rewarded after engaging in 
operant behavior of a selected kind. 
The successful conditioning of an 
operant response is inferred from an 
increase in its rate of occurrence as a 
function of reward (reinforcement) 
administered by the experimenter. 
Through the efforts of those who have 
used operant conditioning procedures, 
significant advances in our knowl- 
edge of animal learning have been 
made possible, and widespread interest 
has been generated in the use of such 
procedures for investigating human 
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part by grants to the first author from the 
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man Cliff, Jum C. Nunnally, Henry Slucki, 
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first author is now at the National Institute 
of Mental Health, Bethesda, Maryland. 


learning. The purpose of the present 
paper is to examine theoretical and 
methodological issues which have arisen 
in recent applications of operant con- 
ditioning techniques to verbal learning. 
In verbal operant conditioning, here- 
after termed verbal conditioning, the 
subject is typically instructed to speak 
in accordance with a particular task. 
He is not told that this task involves 
reinforcement or learning. The experi- 
menter attempts to change the rate of 
emission of certain responses by the 
systematic application of a reinforcing 
stimulus, usually some form of social 
reward, such as the experimenter say- 
ing "Mmm-hmm" or "Good." Studies 
of verbal conditioning originated in 
close connection with the concepts and 
procedures developed by Skinner 
(1938) with  infrahuman species. 
Greenspoon, a pioneer in verbal con- 
ditioning research, maintains that 


- it should be possible to work with 
verbal behavior in much the same way that 
experimenters have worked with the be- 
havior of rats, pigeons, etc. It should also 
be possible to investigate the same kinds of 
variables that have been investigated with 
the non-verbal behavior of humans and 
infrahumans [1962, p. 511]. 
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e the Skinnerian (1957, 1963a, 
63b) approach has been clearly ap- 
o both in the methods employed 
s variables investigated in most 
al conditioning experiments. 
Nui with Skinner's emphasis 
ES experimental analysis of be- 
Give a many verbal conditioners 
aa y ignored the possibility that 
Perse awareness of response-rein- 
their Miis a cag might influence 
bt onditioning performance. In- 
A igators who have examined sub- 
e reports in relation to their 
areas Bin : performance have dis- 
b de onsiderable evidence suggest- 
eorna cognitive processes mediate 
ne hag gains in verbal condition- 
bei nunt escriptive behaviorists have 
tine a in rejoin that the relation- 
cial iid awareness _and per- 
Nis sas served by cognitively ori- 
icc searchers have arisen from 
ier associated with the verbal 
ow Mii ii from which aware- 
eh as inferred. Thus, in verbal 
iu. ang, broadly opposed theoret- 
bee ems lock horns: Descriptive 
ME argue for learning without 
dins ness, those of cognitive persuasion 
ah against it, and proponents of 
fel bows of view generate methodolog- 
ae iticisms of the experimental work 
rried on in the opposing camp.” 


2 
theori should be noted that S-R behavior 
not aoe in the ,Hull-Spence tradition are 
desde to this controversy. The model 
to ac ped by these theorists was designe 
Red ne for the behavior phenomena ex- 
huma; by nonarticulate organisms Or by 
Which 4 in simple learning situations in 
esses Eie operation of higher mental proc- 
conditioni minimal, for example, in, eyeli 
fetes ioning and rote learning. A major dit- 
behavic between the views of descriptive 
is cL and those of Hull and Spence 
ese the latter never claimed that their 
wera concepts would hold for complex 
Po al processes. In his discussion © e 
Stulates and methods of behaviorism nearly 


It would appear that radically dif- 
ferent  epistemologies underlie the 
theoretical differences between Skin- 
nerian and cognitive researchers. The 
epistemological issues, which revolve 
around the admissibility of conscious 
awareness as a desideratum for psy- 
chological science, have been discussed 
in detail elsewhere (Spielberger, 1965). 
Therefore, rather than restating these 
issues here, a simple analogy may 
better serve to illustrate how varying 
pretheoretical assumptions about the 
general nature and scope of science can 
lead to highly disparate approaches to 
a concrete scientific problem. 

It seems reasonable to conjecture 
that biologists of an earlier genera- 
tion who did not believe in protozoa 
probably contended at times that ob- 
jects too small to be observed by the 
naked eye were irrelevant to scientific 
analysis (of disease, for instance). At- 
tempts to explore the domain of the 
microscopic world might have been dis- 
dained by such observers as revealing 
only illusory effects traceable to light 
diffusion in the microscope, and so on. 
Similarly, we might expect that pres- 
ent-day psychologists who consider 
thoughts and hypotheses to be beyond 
the limits of “scientific” inquiry would 
not vigorously search for them in ex- 
perimental subjects. Furthermore, we 
should not be surprised to find that 


2 decades ago, Spence (1948, p. 76) noted 


that: $ L 
...in dealing with the more complex 
types of animal and human behavior, im- 

] responses, covert verbal 


plicit emotional 0 
responses and not easily observable recep- 


tor-exposure and postural adjustments will 
have to be postulated s e 


and that: 


It is in this realm of theorizing that the 


verbal reports of human subjects are likely 
to be of most use to the behavior theorist, 
for presumably these reports can be made 

ich to postulate the occur- 


the basis on whi ilate 
rence of these inferred activities. 
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such psychologists were uninterested 
and unskilled in evaluating cognitive 
phenomena which for them do not 
exist. On the other hand, early biolo- 
gists who believed in protozoa might 
occasionally have "seen „them when 
they were not there. Similarly, we 
might expect, as has been suggested by 
Farber (1963), that cognitively ori- 
ented verbal conditioning researchers, 
in their eagerness to attribute behavior 
to mediating conscious processes, would 
sometimes inadvertently suggest (or 
erroneously infer) awareness in ques- 
tioning subjects who show performance 
gains. 

One obvious implication of the above 
analogy is that competing scientific the- 
ories lead to different experimental pro- 
cedures and often to observational er- 
rors which support the theoretical pre- 
dilections of the investigator (Rosen- 
thal, 1963). In most cases, how- 
ever, such methodological differences 
and observational errors are eventually 
resolved since competing theories gen- 
erally lead to more sensitive experi- 
ments. Furthermore, good experiments 
contribute to the accumulation of a 
composite set of facts which facilitate 
the convergence of theoretical schools 
and the establishment of an organized 
body of scientific knowledge (Camp- 
bell, 1963). But, if the epistemolog- 
ical assumptions which underlie com- 
peting scientific theories differ, the 
methodological consequences of such 
differences may lead to the collection 
of noncomparable data about which 
pointless theoretical controversy is gen- 
erated. Unfortunately, this seems to 
be the case in verbal conditioning. 

In this paper, we propose to examine 
the relative merits of cognitive and 
Skinnerian interpretations of verbal 
conditioning and the methodological as- 
sumptions on which these interpreta- 
tions are based. The organization of 
the paper is as follows: In Section I 
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the use of verbal report measures as 
indexes of awareness (mediating cog- 
nitive processes) will be discussed, 
and the various objections which in- 
vestigators of a Skinnerian bent have 
raised concerning these measures will 
be enumerated and analyzed. In Sec- 
tion II methodological problems as- 
sociated with the assessment of aware- 
ness from postconditioning interviews 
will be considered, and procedures for 
assessing awareness during condition- 
ing will be described along with the re- 
sults of two experiments in which such 
procedures were utilized. A theoretical 
analysis of verbal conditioning in terms 
of cognitive concepts will be presented 
in Section III. In Section IV the 
significance of the verbal coriditioning 
paradigm for the more general question 
of learning without awareness will be 
taken up. 


I. Can VERBAL REPORTS BE INTER- 
PRETED AS INDEXES OF MEDIATING 
Coenitive Processes? 


Methodological criticism has been 
associated with attempts to condition 
verbal behavior since Thorndike’s early 
experiments in this area. In these 
Thorndike was interested in determin- 
ing whether the law of effect operated 
When the subject was unaware of the 
contingency of reinforcement.  Utiliz- 
ing a word-association task, Thorndike 
and Rock (1934) reinforced sequential 
aS opposed to denotative associations 
(ovER-the hill, as opposed to OVER- 
above). They reasoned that if subjects 
gained sudden insight (awareness) into 
the response-reinforcement contingency, 
the number of Sequential responses 
Which they gave would rise sharply. 
Hence, they regarded the gradual in- 
Crease in sequential responses, which 
they in fact obtained, as providing evi- 
dence for learning without awareness. 
This interpretation, however, was im- 
mediately challenged by the demonstra- 
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tion that similar gradual performance 
gains ensued even when subjects were 
informed of the basis of reinforcement 
VN Kauffman, Prior, & Weaver, 
d Te in perhaps an pniorkonate 
ommentary on the narrow, behavioris- 
tic zeitgeist then prevailing that the sim- 
ple procedure of questioning subjects 
Tere utilized in either of these studies. 
m et al. (1934), while they did not 
ili the need for measures of insight or 
ea that are operationally in- 
Pel ME of performance, helped at 
recentl ico the need evident. More 
has = the quest for such measures 
ter generally led to the use of sys- 
ange obtained verbal reports. 
"pies kinenan verbal conditioners 
Be Me pg that subjects should never 
that r D uh nor do they assert 
terai = ble relationships between 
Uns em eports and performance meas- 
this eg be found. They do argue 
fot TM when found do 
MEE nye adequate evidence that 
role How plays a causal intervening 
have owever, descriptive behaviorists 
ada a attempted to investigate the 
Thus X of verbal report procedures. 
gei their criticisms of them appear 
Stem chiefly from the implicit as- 
Eae that consciousness is in prin- 
nes HORUM ape that it is un- 
mol sary to consider it. These episte- 
‘logical biases are not often made 
i E but under. their influence 
m s of cognitive interpretations of 
inum conditioning phenomena are 
on methodological and theoretical 
Grounds. It is to such criticisms * that 
we now turn. 
o ese A rape have mot, been De 
hat folloy in detail in any gam — 
relevant aby eee an pides fosse eee 
al reports in "verbal conditi ag in terms 
of mediating stes. These have been adapted 
from E states. hese have ders 
y and unfriendly sources, inclu 


ing Dulany (1961, 1962), Eriksen (1958, 
0, 1962), Greenspoon (1962, 1963), Kan- 
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1. The first group of objections ad- 
vanced by descriptive behaviorists to 
the interpretation of verbal reports in 
cognitive terms is concerned with the 
possibility that awareness is suggested 
by the procedures which are employed 
to assess it. Questioning techniques 
used to evaluate awareness may, it is 
argued, differentially suggest the con- 
tingency of reinforcement to subjects 
who condition as compared to those 
who do not. When both the condition- 
ing task and the awareness interview 
are conducted by the same experi- 
menter, it is certainly plausible that the 
experimenter might inadvertently ask 
questions and/or record answers in 
such a way as to suggest and/or impute 
awareness to those subjects who showed 
performance gains. A related way in 
which the interviewer might arti- 
factually bring about a relationship be- 
tween performance and awareness is 
through cues provided by the wording 
of interview questions. Questioning 
subjects about their experiences during 
an experiment obviously leads them to 
reflect on their performance and pos- 
sibly also to develop hypotheses about 
the response-reinforcement contingency. 

2. A second general class of criticism 
contends that subjects condition with- 
out awareness, notice their increas- 
ing outputs of the critical response 
class, and then label the response-rein- 
forcement contingency. Inother words, 
it is argued that the subjects’ knowl- 
edge of the contingency emerges as a 
result of an automatic conditioning 
process, and those subjects who in- 
dicate in a postconditioning interview 
that their performance was mediated 
by their hypotheses are merely rational- 

i leagues (Kanfer & Marston, 
e an b & McBrearty, 1961), 
à Krasner and Ullmann 


(1962), ns 
(1963), and Sassenrath (1961), 


Postman © jelberger (1962, 1965 
Salzinger (1959), Spie vi : ), 


and especially Dulany 
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izing their performance gains. This 
line of criticism calls into question cor- 
respondences between performance and 
awareness based on measures taken 
after performance gains have ensued 
and points up the theoretical signifi- 
cance of the temporal sequence of 
events in verbal conditioning, that is, 
whether in fact performance gains 
precede awareness or occur subsequent 
to the subjects’ development of aware- 
ness. 

3. A more complicated variant of the 
point of view indicated in 2 above has 
been suggested by Postman and Sas- 
senrath (1961)* who state that “aware- 
ness as reflected in verbalization may 
represent an advanced stage in the de- 
velopment of a habit under conditions 
of reinforcement [p. 124]." They go 
on to suggest that the verbalization of 
a response-reinforcement contingency 
"may be considered at the same time a 
result of past improvement and a con- 
dition for futher improvement [p. 
124]." The tenability of this formula- 
tion, which ascribes a secondary media- 
tional role to awareness, rests upon the 
demonstration of significant amounts 
of learning prior to the subjects’ 
verbalization of a correct or partially 
correct contingency. 

4. Another line of criticism contends 
that awareness is merely a covert re- 
sponse which is conditioned over trials, 
in an automatic fashion, at the same 
time as the reinforced response class is 
strengthened. Postman and Sassenrath 
(1961), in discussing symbolic medi- 
ators, have posited this sort of mechan- 
ism to explain the acquisition of hy- 


4It should be noted that although Post- 
man and Sassenrath cite findings of verbal 
conditioning investigations as relevant to 
this interpretation, their conclusions are 
based primarily upon Thorndikian experi- 
ments in which gradual increments in per- 
formance typically precede the point of 
verbalization. 
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3. SECONDARY MEDIATION 
R---S4— RI— Sp — Cinare — Rf 


4. JOINT CONDITIONING EFFECTS 


Fic. 1. Schematic diagrams depicting non- 
cognitive interpretations of verbal reports of 
awareness in verbal conditioning. 


potheses in instrumental learning, They 
state: “The differential reinforcements 
administered for the overt instrumental 
responses will at the same time selec- 
tively strengthen the correct mediating 
responses [p. 132]." In this view, 
awareness is assigned the properties 
of an operant Tesponse, and it is hy- 
pothesized that covert verbalizations of 
the reinforcement principle (aware- 
ness) are directly and automatically 
strengthened by the same reinforce- 
ment process through which other in- 
strumental responses are conditioned.’ 
The formulations described above 
are presented as schematic diagrams in 
Figure 1. These diagrams are not 


5A similar interpretation is suggested by 
ei and McBrearty (1961) who contend 
at: 

+ verbalization of the response-rein- 
forcement contingency may be independ- 
ently „affected by different antecedent 
conditions than the acquisition rate of the 
(reinforced verbal) response class, even 
though experimental conditions may some- 
times facilitate an interaction between 

ese two events [p. 116]. 
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intended to represent particular view- 
Points in detail, but are offered only 
Den a „as an aid in expressing 
m pion el of verbal conditioning 
= : h consider verbal reports primarily 
eh ia rather than as indexes of 
the Pii states. In each schematic, 
front won _ sequence of events is 
e eft to right. Events which are 
den) temporally contiguous are in- 
Massi by dotted lines; presumed 
à al relationships between events are 
p ole by solid arrows. 
a iiio l illustrates the two pos- 
seest, i hereby awareness might be 
subject ed by questioning procedures to 
talhs UR showed performance 
subject e symbol R stands for the 
beler a verbalization of responses 
du d to the reinforced response 
albidis n refers to the reinforcing 
enie y administered by the exper 
ions or each R. Rf indicates per- 
athe a gains.. S, represents biasing 
o dia. might be transmitted un- 
dimen to the subject by an inter- 
Jeca r who had knowledge of the sub- 
as conditioning performance. So 
the = for possible cues conveyed. by 
lead ii. of questions which might 
lcs à bjects who showed performance 
aft to construct correct hypotheses 
er the fact by reflecting on their 
Performance. 
hea Nematic 2 diagrams the hypo- 
in cal automatic strengthening-label- 
"ial sequence in which subjects are sai 
E anion without awareness an 
rend to become aware of the response 
ES oo contingency after notic- 
d eir increasing output of reinforce 
ponses, Sp 4. 
5 Schematic 3 depicts the hypothetical 
oe of mediating processes sub- 
Ei emn to automatic conditioning. . s 
= cribed in Schematic 2, the subject 
in comes aware as a result of increments 
a ee m brought about by the 
omatic effects of reinforcement an 
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utilizes this awareness further to en- 
hance performance gains. In this dia- 
gram, RT symbolizes augmented per- 
formance consequent upon the subject 
becoming aware of the correct con- 
tingency. 

Schematic 4 illustrates the view that 
reinforcement acts simultaneously to 
condition both the reinforced response 
class and verbalization of the reinforce- 
ment principle. Presumably, reinforce- 
ment would operate in this paradigm to 
strengthen covert verbalizations of the 


correct response-reinforcement contin- 
gency, Ra. 

Underlying all of the above formula- 
tions is the epistemological viewpoint 
mentioned earlier. It would appear 
that what descriptive behaviorists ob- 
ject to basically is the use of verbal re- 

orts as a Trojan horse for the rein- 
troduction of private experiences into 
psychology. Since private experience 
does not meet positivistic canons of 
interobserver reliability, they argue 
largely on epistemological grounds that 
interest in it remains but a vestige of 
discredited introspectionism. How- 
ever, the heuristic value of operation- 
ally defined awareness variables in 
verbal conditioning has been demon- 
strated empirically, and the numerous 
instances in which such measures have 
been found to relate consistently and 
fully to other variables pro- 
vides strong justification for their use, 
irrespective of how they are interpreted. 
Over and above this pragmatic value, 
the utilization of awareness as a sys- 
tematic concept has led to experimental 
fndings which the Skinnerian ap- 
proach would not have predicted and 
can explain only with piecemeal aug- 
menting assumptions (Dulany, 1962; 
Spielberger, 1962). . 

In this paper we will contend that 
the descriptive behaviorists' rejection, 
on epistemological grounds, of verbal 
reports as indexes of mediating states 


meaning 
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has retarded the convergence of em- 
pirical findings and the development 
of adequate theory in verbal condition- 
ing. We will argue moreover that this 
general approach has contributed little 
to the understanding of verbal behavior 
because, as Guttman (1963, p. 119) has 
recently suggested with respect to per- 
ception : 

. the effort to speak the language of 
behavior only is just inefficient, and to do so 
leads to bad experiments . . . , to a truncated 
set of laws of behavior, and to wide mis- 
interpretation as to what our current knowl- 


edge of behavioral laws portends, theo- 
retically and practically. 


Having stated these goals and claims, 
we may now proceed to a discussion of 


procedures used for assessing aware- 
ness in verbal conditioning. 


II. THE Use or VERBAL REPORTS 
IN VERBAL CONDITIONING 


The questioning techniques employed 
to assess awareness in verbal condition- 
ing studies have included both oral 
interviews and written-response ques- 
tionnaires. These have been utilized 
both during and after conditioning 
trials. In this section, we will first 
consider methodological factors asso- 
ciated with the assessment of aware- 
ness from postconditioning interviews. 
We will then describe procedures de- 
veloped for assessing awareness during 
conditioning and the findings of two ex- 
periments in which such procedures 
were employed. Finally, the noncogni- 
tive formulations developed in Section 
I will be reviewed in the light of these 
studies. 


The Assessment of Awareness after 
Conditioning 


The assessment of awareness from 
postconditioning interviews is in- 
fluenced by the style of questioning, the 
wording of questions and the number 
of questions asked, the interpolation of 
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extinction trials, and the experimenter’s 
concern with correlated hypotheses. 
Each of these factors will be discussed 
below in terms of its potential effect on 
reports of awareness obtained in post- 
conditioning interviews. 

Oral interview versus written ques- 
tionnaire procedures. Both oral inter- 
views and written-response question- 
naires have been utilized for assessing 
awareness after the completion of con- 
ditioning trials. Either style of ques- 
tioning would appear to have certain 
characteristic advantages and disad- 
vantages. Written-response question- 
naires eliminate the potential biasing 
influence of the experimenter who con- 
ducts an oral interview. However, the 
responses elicited by this procedure 
may not be as complete as those ob- 
tained from questions asked and an- 
swered orally, and clarification of un- 
clear replies through additional inquiry 
is not generally feasible. Furthermore, 
some subjects may be inclined to avoid 
being incorrect by not responding when 
they are unsure of their hypotheses, 

Number and wording of questions. 
In early verbal conditioning studies, 
awareness interviews were very brief, 
and the questions were vaguely worded. 
For example, in one study (Sidowski, 
1954) the postconditioning interview 
consisted of only two questions: (a) 
"Were you aware of the purpose of 
the experiment?" and (b) “Were you 
aware of the purpose of the light?" 
(the contingent reinforcing stimulus). 
In response to these questions, a sub- 
Ject who might have been perfectly 
aware of the response-reinforcement 
contingency might not have had any 
idea of the “scientific purpose” of the 
experiment nor of the reinforcing stim- 
ulus, Furthermore, questions asked in 
this form make it easier to say “No” 
than “Yes” since the latter response 
will obviously require elaboration. A 
Series of questions in which subjects 
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were asked to state their thoughts about 
when and why the light was blinking 
would have been more satisfactory. 
Given minimal questioning, subjects 
tend to respond according to their own 
preconceptions about what is important 
and refer to the reinforcing stimulus 
only when asked about it. For ex- 
ample, after the conclusion of a pseudo- 
ESP verbal conditioning experiment, 
Krieckhaus and Eriksen (1960) held 
informal conversations with subjects 
who showed performance gains, but 
failed to verbalize a correct response- 
reinforcement contingency in a struc- 
tured interview. They reported: 


Two (subjects) verbalized very clearly their 
awareness of the effects of reinforcement 
upon their behavior. When asked why they 
had not stated this in response to the post- 
conditioning interview questions, both of 
them commented that they hadn't understood 
that this was what E was asking. They both 
stated that they thought E was concerned 
with whether or not they Were getting ESP 
messages, rather than trying to find out 
whether the reinforcement of “good” had 
affected their choice of responses [p. 515]. 


Thus, the requirement of asking for 
information with clear and adequately 
worded questions is coordinate with 
the necessity of asking a sufficient num- 
ber of questions. Jr 

Although more detailed questioning 
might be thought to increase the likeli- 
hood of suggesting awareness indis- 
criminately (Farber, 1963), a series of 
questions of gradually increasing Speci- 
ficity enables the interviewer to assess 
awareness and motivation while pro- 
viding minimal surplus information. 
Furthermore, in studies in which such 
interviews were employed (see Spiel- 
berger, 1962), it has been demonstrated 
that: (a) Substantial numbers of sub- 
jects were identified as aware of cor 
rect response-reinforcement contingen- 
cies who were not so classified on the 
basis of brief interviews; and (b) per- 
formance gains were limited essentially 


to aware subjects and specific to those 
responses for which the individual sub- 
ject was aware of a correct contingency. 

Extinction trials. In verbal condi- 
tioning, the practice of interpolating ex- 
tinction trials (e.g., Cohen, Kalish 
Thurston, & Cohen, 1954; Greenspoon, 
1955) or other tasks (Gergen, 1965; 
Sassenrath, 1959) between acquisition 
trials and the interview in which aware- 
ness is assessed probably reduces the 
likelihood that awareness will be de- 
tected. Subjects who are aware of a 
correct contingency during conditioning 
may either forget their hypotheses, in- 
terpret the withdrawal of the reinforc- 
ing stimulus during extinction as a sign 
that they were wrong, or both. We 
have observed that, even during ac- 
quisition trials, instances of accidental 
nonreinforcement due to experimenter 
error sometimes suffice to disconfirm 
subjects’ hypotheses (Spielberger & 
DeNike, 1963). 

Correlated hypotheses. The prob- 
lem of assessing correlated hypotheses 
in learning experiments was first raised 
by Postman and Jarrett (1952). In 
verbal conditioning, a subject is con- 
sidered to have a correlated hypothesis 
if he verbalizes a contingency which 
is different from, but correlated with, 
the reinforcement principle employed 
by the experimenter (Dulany, 1961). 
Thus, a subject need not be aware of 
the experimenter's definition of the 
contingency in order to show perform- 
ance gains through conscious pursuit of 
his own hypotheses. For example, sup- 
pose one spring a V shimsical millionaire 
were to emplace a few gold nuggets on 
some fertile ground on his estate and 
announce magnanimously to the towns- 
folk that all were welcome to come 
there and shovel around on the sur- 
face. Anyone inquiring among these 
townsfolk as they labored would find 
them highly aware that they were 
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“digging for gold" and highly unaware 
that they were "spading a garden.” 
In the traditional ges conditioning 
in which plural nouns are rein- 
forex (Greenspoon, 1955), a subject 
would receive 100% reinforcement if 
he said serially, “apples, , oranges, 
peaches,” etc. When interviewed, he 
might report as his hypothesis that re- 
inforcement is contingent upon his 
naming “fruits.” While the response 
class "fruits" is formally different from 
“plural nouns," the subject who named 
fruits would be reinforced for acting on 
this hypothesis and would be likely to 
develop the conviction that he was being 
encouraged to'say words of similar 
meaning, as Dulany (1961) has con- 
vincingly shown. "That subjects some- 
times develop and pursue correlated 
ideas in response to less than 100% 
reinforcement points up the importance 
of random reinforcement rather than 
no reinforcement for control groups in 
verbal conditioning experiments (De- 
Nike & Spielberger, 1963 ; Spielberger, 
DeNike, & Stein, 1965). And as 
Adams (1957) has pointed out, cor- 
related hypotheses are typically not 
evaluated in investigations which have 
reported evidence for learning without 
awareness. 

Thus far, we have discussed methodo- 
logical factors which influence reports 
of awareness obtained from interviews 
or questionnaires conducted after the 
completion of the conditioning task. 
As previously noted, in recent verbal 
conditioning experiments in which such 
factors were taken into account, per- 
formance gains were limited essentially 
to subjects who verbalized a correct or 
‘correlated contingency. However, in 
these experiments, awareness was in- 
ferred from responses to interview 
questions asked after performance 
measures had been taken. Therefore, 
it could be argued that the performance 
gains of aware subjects were auto- 
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matically produced by reinforcement 
and that the awareness-performance 
relationship could be accounted for by 
any of the noncognitive explanations 
described in Section I. The assess- 
ment of awareness during conditioning 
would provide a firmer basis for evalu- 
ating the temporal relationship between 
performance and awareness in verbal 
conditioning. Such procedures will 
now be considered. 


The Assessment of Awareness during 
Conditioning 


Most noncognitive explanations of 
verbal conditioning implicitly assume 
that increments in performance initially 
result from the direct and automatic 
effects of reinforcement and that per- 
formance gains occur prior to the time 
that the subject becomes aware of a 
correct response-reinforcement contin- 
gency. On the other hand, it is argued 
in cognitive explanations of verbal con- 
ditioning that awareness precedes per- 
formance increments. Thus, reinforce- 
ment theory and cognitive theory lead 
to differential predictions with regard 
to the temporal relationship between 
the development of awareness and the 
inception of performance gains. But 
in order to evaluate temporal factors in 
verbal conditioning, a procedure is re- 
quired which will permit determination 
of whether or not subjects become 
aware of a correct contingency during 
conditioning, and if so, when. The re- 
sults of two studies in which awareness 


was assessed during conditioning are 
discussed below. 


Experiment I. DeNike (1964) reinforced 
female college students for giving human- 
noun responses in a word-naming task 
(Matarazzo, Saslow, & Pareis, 1960). His 
subjects were required to write down their 
thoughts about the experiment” after each 
block of 25 response words. As a signal for 
the subject to record her thoughts, a light 
Was turned on which remained lit until the 
subject indicated she was ready to resume 
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ment, it would be expected that gradual 
increments in performance would occur for 
all subjects and that performance gains for 
aware subjects would occur prior to the 
word block on which they recorded a correct 
contingency in their notes. However, if 
of the reinforced response class 
onditioning is mediated by aware- 
ins would be expected 

and these should 
occur O! the word block 
on which such subjects recorded a correct 
The performance curves of 

Unaware, and Control 
are presented in Figure 2A. It may 
d that the output of human nouns 
for the Aware Group increased markedly 
reinforced word blocks and that 
and Control Groups failed to 
performance gains. However, al- 
though only subjects who recorded correct 
contingencies in their notes gaye more human 
nouns, it cannot be determined from the 
ta as arrayed in Figure 2A whether or 
not aware subjects showed increments in per- 
jor to the word block on which 
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Fic. 3. Conditioning curves. (A: Mean 
percentage of human-noun responses given 
by the Aware-Pre, Aware-Post, and Un- 
aware Groups on the operant word blocks 
and the reinforced word blocks prior to 
the incentive-inducing instruction. B: Mean 
percentage of human-noun responses given 
by the Aware-Pre, Aware-Post, Aware-Int., 
and Unaware-Int. Groups on the word blocks 
subsequent to the incentive-inducing instruc- 
tion. Adapted from Spielberger et al., 1966.) 


ship between performance gains and aware- 
ness, the conditioning data for the aware 
subjects were examined as a function of the 
word block on which each first recorded a 
correct contingency in her notes. This word 
block was designated the “0” block. Word 
blocks prior to and subsequent to Block 0 
were designated the preverbalizatio 
postverbalization blocks and were labeled 
respectively with negative and Positive inte- 
gers, after the practice of Philbrick and 
Postman (1955). The 0 blocks of the 
aware subjects were then aligned, and the 
data for the preverbalization and postverbali- 
zation blocks were separately Vincentized 
(Munn, 1950). The Vincentized condition- 
ing curve for the Aware Group is given in 
Figure 2B in which it may be noted that 
performance on the conditioning task: (a) 
remained at essentially the same level in the 
preverbalization blocks as in the operant 
blocks, (b) increased markedly on the 0 
block, and (c) was maintained at a relatively 
high level during the  postverbalization 


n and 
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blocks. The finding that the performance 
increments of aware subjects first occurred 
during the word block on which they first 
recorded a correct contingency in their notes 
would appear to indicate that their increased 
output of human-noun responses was cog- 
nitively mediated. The absence of pre- 
verbalization performance gains would ap- 
pear particularly difficult to explain by the 
use of learning theories which ascribe auto- 
matic trans-situational reinforcing effects to 
verbal stimuli. 

Although it was possible, according to a 
cognitive theoretical interpretation of verbal 
conditioning, for aware subjects in DeNike's 
study to give essentially 100% human-noun 
responses on each postawareness word block, 
it may be noted in Figure 2 that the mean 
number of such responses given by these 
subjects never exceeded 3596. Interview 
data obtained by DeNike after the conclu- 
sion of the conditioning task revealed some 
aware subjects for whom the reinforcing 
stimulus had no incentive value, and these 
subjects failed to show performance gains 
(DeNike, 1965). In contrast, aware subjects 
who reported that they wanted to receive 
reinforcement showed a marked increase in 
their output of human nouns on postaware- 
ness word blocks. In the next study to be 
reported, a uniformly high level of motiva- 
tion to receive reinforcement was induced 
during conditioning in all subjects. 

Experiment II. Using similar condition- 
ing procedures to those described for Ex- 
periment I, female college students were 
reinforced with “Mmm-hmm” for giving 
human-noun responses (Spielberger, Bern- 
stein, & Ratliff, 1966). Also, as in Experi- 
ment I, awareness of a correct response- 
reinforcement contingency was evaluated 
from the thoughts about the experiment 
(notes) which subjects recorded after each 
word block. However, the procedures dif- 
fered from those of the previous experiment 
in that, between the seventh and eighth rein- 
forced word blocks of the conditioning task, 
each subject was told: “As you may have 
noted, there is a rule under which I say 
‘Mmm-hmm’ Try to act on that rule so 
as to make me say ‘Mmm-hmm’ as often as 
you can." The purpose of this instruction 
Was to increase the incentive value of the 
reinforcing — stimulus, Immediately after 
conditioning a different experimenter con- 
ducted an interview which checked on the 
incentive manipulation and provided addi- 
tional data on subjects’ awareness. 

The notes which subjects recorded during 
Conditioning were rated by two judges who 
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had neither contact with the subjects nor 
knowledge of their performance on the con- 
ditioning task. The judges agreed perfectly 
in classifying cach subject as either aware 
or unaware of a correct contingency. For 
the subjects rated aware, the judges also 
indicated the word block on which a correct 
contingency was first reported. Those sub- 
Jects who recorded a correct contingency 
prior to the incentive-inducing instruction 
were designated the Aware-Pre Group; 
those who did so subsequent to this instruc- 
tion were designated the Aware-Post Group; 
subjects who failed to record a correct con- 
tingency during conditioning were called the 
Unaware Group. The mean percentage of 
human-noun responses given by the three 
groups in the operant blocks and the rein- 
forced blocks prior to the incentive-inducing 
instruction are given in Figure 3A in which 
the Aware-Pre Group's conditioning data 
have been Vincentized. Statistical analyses 
of these data indicated that: (a) The three 
groups did not differ during the operant 
blocks; (b) the performance gains of the 
Aware-Pre Group first occurred on the word 
block on which these subjects first recorded 
a correct contingency; and (c) the Aware- 
Post and Unaware Groups failed to show 
performance gains. Thus, prior to the in- 
centive-inducing instruction, aware and un- 
aware subjects performed much as did their 
counterparts in DeNike's study (see Fig- 
ure 2). 

Data obtained in the poste: 
terview indicated that the inst 
to make the experimenter say 
had the desired effect on the St 
vation to receive reinforcement. Almost al 
the subjects indicated that, as à consequence 
of the instruction, they wanted more for the 
experimenter to say «Mmm-hmm," and they 
iried harder to make him say it The per- 
formances of the Aware-Pre, the Aware- 
Post, and Unaware Groups in the word 
blocks subsequent to the instruction are 
indicated in Figure 3B. The immediate 
marked increase in the number of human 
nouns given by the Aware-Pre. Group on 
Block 8 suggested that the incentive manipu- 
lation increased the motivation of these sub- 
jects to receive reinforcement and that their 
prior awareness of a correct contingency 
permitted them at once to give more human- 
noun responses. The somewhat less marked 
increase in human nouns for the Aware- 
Post Group subsequent to the instruction 
may be interpreted as indicating that these 
subjects, most of whom recorded a correct 


contingency in their notes On Block 8, had 


onditioning in- 
ruction to try 
*Mmm-hmm" 
1bjects' moti- 


to become aware before heightened motiva- 
tion induced by the instruction could influ- 
ence their performance. Consistent with 
this interpretation, examination of the con- 
ditioning data for individual subjects in the 
Aware-Post Group revealed that initial 
performance increments tended to occur on 
the word block on which a correct contin- 
gency was first recorded. 

The mean percentage of human-noun re- 
sponses given by the Unaware Group in- 
creased gradually in the postinstruction word 
blocks, from 15% on Block 8 to 30% on 
Block 10 (not shown in Figure 3B), sug- 
gesting that these subjects conditioned with- 
out awareness. In order to evaluate this 
possibility further, the postconditioning inter- 
view protocols of all subjects who failed to 
record a correct contingency in their notes 
were examined by two judges who had 
neither contact with the subjects nor knowl- 
edge of their performance on the conditioning 
task. The judges agreed perfectly in rating 
approximately half of these subjects as 
aware of a correct contingency solely on the 
basis of their verbal reports in the interview. 

The performance of subjects rated aware 
on the basis of their interview responses, but 
unaware on the basis of their notes (Aware- 
Int. Group), is compared, in Figure 3B, 
with the performance of subjects rated un- 
aware on the basis of both their interview 
responses and their notes (Unaware-Int. 
Group). For the latter group, the output 
of human-noun responses subsequent to the 
instruction did not increase. Indeed, by the 
final word block, the number of human nouns 
given by the Unaware-Int. Group was below 
what it had been prior to the instruction. 
In contrast, the conditioning curve for the 
Aware-Int. Group showed a significant rise 
subsequent to the instruction, and perform- 
ance increments for individual subjects 
tended to correspond with the word block 
on which they claimed they became aware. 
Furthermore, several of these subjects spon- 
taneously reported that they had not recorded 
a correct response-reinforcement contingency 
in their notes because they did not become 
aware of it until the final word block and 
did not have sufficient confidence in their 


hypothesis to report it. 

In sum, the findings in Experiments I and 
II are consistent with the hypothesis that 
"what is learned" in verbal conditioning is 
awareness of a correct response-reinforce- 
ntingency. These findings would also 
appear to indicate that the reinforcing stimu- 
lus in verbal conditioning has both informa- 
tion and incentive value (Dulany, 1962), 


ment co! 


318 CHARLES D. SPIELBERGER 


and that the latter influences the degree to 


which subjects who are aware of a correct 
contingency act on their awareness. The 


i i ncognitive explana- 
uacy with which noi 
one of verbal conditioning can account for 


the results in these experiments is consid- 
ered below. 


Noncognitive Interpretations of the 
Findings in Experiments I and II 


The findings in Experiments I and 
II would appear difficult to account for 
in terms of a theory which does not 
include a concept of awareness as a 
mediating cognitive state or process. 
In these investigations, only subjects 
who were judged to be aware of a 
correct response-reinforcement contin- 
gency showed performance gains, and 
these tended to correspond with the 
word block on which each aware sub- 
ject first recorded a correct contingency. 
However, the question remains whether 
the obtained relationships between per- 
formance gains and operational indexes 
of awareness can be explained without 
recourse to cognitive variables. The 
possibility that these relationships arose 
artifactually will now be examined in 
terms of each of the noncognitive for- 
mulations described in Section I (see 
Figure 1). 

Suggestion. The possibility that 
awareness might have been suggested 
to subjects who showed conditioning 
through cues provided by a biased ex- 
perimenter was practically eliminated 
in Experiments I and IL. In these 
studies, awareness was inferred pri- 
marily from notes recorded during con- 
ditioning, and these notes were not 
written in response to potentially bias- 
ing questions. The signal for the sub- 
ject to record her thoughts, a light 
presented at uniform intervals, was a 
completely impersonal nonverbal stim- 
ulus which was in no way contingent 
upon the subject's conditioning per- 
formance. For those subjects in Ex- 
periment II whose awareness was 
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inferred from their responses to the 
postconditioning interview, the experi- 
menter who conducted this interview 
had neither prior contact with the sub- 
jects nor knowledge of their perform- 
ance on the conditioning task. 

Labeling. The findings in Experi- 
ments I and II do not support the 
interpretation that reinforcement auto- 
matically produced increments in per- 
formance and that the subjects’ knowl- 
edge of a correct response-reinforce- 
ment contingency subsequently resulted 
from their noticing and labeling the re- 
lationship between the critical response 
class and the reinforcing stimulus. As 
may be noted in Figures 2B and 3A 
the performance gains of aware sub- 
jects first occurred on the word block 
on which these subjects recorded a 
correct contingency in their notes. 
Thus, the reinforcing stimulus had no 
apparent influence on performance prior 
to the subjects becoming aware of a 
correct contingency. And while it still 
might be argued that automatic rein- 
forcement effects produced performance 
gains during the 0 word block, the 
absence of performance gains on the 
preceding word blocks would remain 
unaccounted for. Only the performance 
of the small group of subjects (N — 4) 
in the Aware-Int. Group of Experi- 
ment II might be interpreted as con- 
sistent with the labeling hypothesis. 
But even these subjects, who failed to 
record a correct contingency in their 
notes but reported it in the postcondi- 
tioning interview, tended spontaneously 
to explain this discrepancy in terms of 
insufficient opportunity to test their 
hypotheses. 

Secondary mediation. The second- 
ary-mediation hypothesis assumes that 
automatic performance increments pre- 
cede the development of awareness of 
a correct contingency, and that aware- 
ness then facilitates further perform- 
ance gains. This view is contradicted 
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by the same findings in Experiments I 
and II which led us to challenge the 
labeling interpretation. The demon- 
stration of automatic reinforcement ef- 
fects prior to the verbalization of 
awareness would appear crucial in 
order to support the hypothesis of sec- 
ondary mediation. No such evidence 
was found, except possibly for the sub- 
jects in the Aware-Int. Group in Ex- 
periment II. Thus, at the very least, 
the | secondary-mediation hypothesis 
would require extensive modification 
if it is to account for all of the findings 
in Experiments I and i 
_ Joint conditioning effects. Accord- 
ing to the joint conditioning hypothesis, 
gradual gains in performance on the 
conditioning task would be expected 
along with the simultaneous strength- 
ening of the covert “awareness Te- 
sponse.” The absence of performance 
increments in Experiments I and 

prior to the word block on which aware 
Subjects recorded a correct contin- 
gency, and the striking temporal cor- 


respondence of the inception of per- 
formance gains with the verbalization 
dictate 


of awareness would seem to 
the additional assumption of a “thresh- 
old” concept in order to support the 
view that the reinforced response class 
and a covert awareness response 
were simultaneously and automatically 
strengthened. Moreover, it would also 
appear necessary to assume that the 
threshold at which awareness was ver- 
balized corresponds to the threshold at 
which the cumulative effects of rein- 
forcement influence performance on the 
conditioning task, and that performance 
and covert awareness responses were 
practically identically susceptible to 17 
fluence by reinforcement in each indi- 
vidual subject. While such assump- 
tions might provide tenuous post-hoc 
explanations for the findings obtained 
in Experiments I and II, the conclu- 
Sion appears inescapable that a theory 
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Fic. 4. Schematic diagram indicating the 
hypothetical processes which are assumed 
to mediate performance gains in verbal con- 
ditioning for subjects who show “condition- 


ing" effects. 


of automatic strengthening of verbal re- 
sponse by reinforcement requires ex- 
tensive modification and radical aug- 
mentation before it can account for 
such findings. In the next section, a 
theoretical analysis of verbal condi- 
tioning in terms of the concepts of cog- 
nitive learning theory will be proposed. 


III. AN ANALYSIS OF VERBAL 
CONDITIONING IN TERMS OF 
CocxiriVE. CONCEPTS 


The schematic diagram presented in 
Figure 4 reflects a general formulation 
of the sequence of hypothetical events 
which we believe mediate performance 
gains in verbal conditioning for sub- 
jects who show "conditioning" effects. 
The diagram is not intended to repre- 
sent any particular cognitive theory in 
detail. Rather, it is offered to account 
for the findings of recent investigations 
of verbal conditioning, such as those 
described in Section II, at the same 
level of specificity as the noncognitive 
formulations presented in Figure 1. 
A similar more detailed and sophisti- 
cated theoretical analysis of verbal con- 
ditioning has been recently advanced 
by Dulany (1962). 

The temporal sequence of events in 
Figure 4 is from left to right. Those 
events which are merely temporally 
contiguous are indicated by dotted 
lines; presumed causal relationships 
between events are represented by solid 
arrows. The symbol Ra indicates that 
in verbal conditioning the subject ini- 
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i mits a variety of responses, some 
di Sch belong to the reinforced class, 
R., and some of which do not, R 
These are followed respectively by re- 
inf ant, Sg, and nonreinforce- 
inforcement, : 

ment, Sg-, which convey differential in- 
formation and provide differential in- 
centive to the subject. — 

The information provided by the re- 
inforcing stimulus gives rise to one or 
more cognitive states, represented as 
As Ax s Ay. For example, the 
sequence of cognitive states developed 
by a subject in a word-naming verbal 
conditioning task might be: "The ex- 
perimenter sometimes says 'good' after 
some of my words" (A); “Perhaps 
his saying 'good' depends on what 
words I say" (Ag) ; “Perhaps he wants 
me to figure out what words he is say- 
ing 'good' to, and give more of those" 

(As), etc. Depending upon the pat- 
tern and amount of reinforcement 

which they receive and recall, some sub- 
jects may be able to skip some of these 

early logical steps and go directly to a 
tentative formulation of the correct 

hypothesis, Ay: “Perhaps he is saying 
‘good’ when I say words denoting 
people.” We have observed that sub- 
jects who verbalize a correct hypothesis 
approach the experiment essentially as 
a problem-solving task. Furthermore, 
in verbal conditioning experiments in 
which subjects are given explicitly a 
problem-solving set, they proceed more 
directly to test out their hypotheses, 
and a larger proportion verbalize a 
correct contingency (DeNike & Spiel- 
berger, 1963; Spielberger & Levin, 
1962). 

Concurrent with the concept-forma- 
tion activity involved in arriving at a 
correct hypothesis, subjects also de- 
velop affective and conative states 
based on the incentive provided by the 
reinforcing stimulus. For example, if 
the reinforcing stimulus has positive 
incentive value for a particular subject, 
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he will want to receive reinforcement, 
and generally will /ry to elicit it from 
the experimenter. On the other hand, 
if the reinforcing stimulus has neutral 
or negative incentive value for the 
subject, he may be indifferent to it or 
attempt to avoid it (Mandler & Kap- 
lan, 1956). Thus, the subject’s affec- 
tive and conative states in conjunction 
with his cognitive states (hypotheses) 
lead him to give responses, R,, which 
he believes are in the reinforced re- 
sponse class, intermixed with a few re- 
sponses, R, which are not expected to 
elicit reinforcement. Following these 
responses, the selective administration 
of the reinforcing stimulus, Sn, or the 
absence of the reinforcing stimulus, Sr , 
either confirms the subject's hypothesis, 
Ac, or leads him to modify it. The 
confirmed hypothesis, if accompanied 
by appropriate affective and conative 
states, results in increased output of the 
reinforced response class, Rf. 

It should be noted that the schematic 
diagram in Figure 4 massively over- 
simplifies our view of verbal condition- 
ing. In it, we have treated affective 
and conative states as essentially con- 
stant and independent of cognitive 
states; there is much reason to suspect 
that these fluctuate and interact with 
the subjects’ hypotheses. We have 
diagramed the process for the suc- 
cessful subject and ignored the numer- 
ous blind alleys explored by both suc- 
cessful and unsuccessful subjects; such 
Processes should receive consideration 
in any theory which claims complete- 
ness. The nature of the conditioning 
task, the status of the experimenter, 
the instructions given to the subject, 
the characteristics of the subject pop- 
ulation—all of these have demonstrable 
influence on mediating states and per- 
formance in verbal conditioning. In 
the final section of this paper, the sig- 
nificance of the verbal conditioning 
paradigm for the more general ques- 
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tion of human learning without aware- 
ness will be considered. 


IV. VERBAL CONDITIONING AND 
THE QUESTION oF LEARNING 
WITHOUT AWARENESS 


Few persons doubt that behavior 
occurs without awareness, as in, for 
example, shifting gears, fingernail bit- 
ing, facial tics, etc. (Eriksen, 1960). 
Of theoretical interest, however, is the 
extent to which such behaviors are in- 
itially learned without the mediation of 
Cognitive processes. Kimble (1962) 
has suggested that the question of 
whether learning without awareness is 
possible may not have the same an- 
swer for all forms of learning. In 
classical eyelid conditioning, according 
to Kimble, it would appear that the 
subject need only be aware of the un- 
conditioned stimulus for learning to 
take place. 

In verbal conditioning, the evide 
for learning without awareness reported 
in early experiments (¢.g-, Cohen et al., 
1954 ; Greenspoon, 1955; Taffel, 1955) 
now appears suspect because of short- 
comings in the evaluation of awareness 
in these studies. In recent experiments 
in which awareness was more carefully 
and thoroughly assessed (e£ DeNike, 
1964; Dulany, 1961, 1962; Spielberge", 
1962), the absence of learning-without- 
awareness effects all but compels the 
Conclusion that performance gains in 
verbal conditioning are consciously 
mediated. However, for the reasons 


nce 


9It should be noted, however, that some 
Subjects who are unaware ofa correct con- 
tingency may nevertheless show slight gains 
in performance in verbal conditioning if they 
develop what may be termed à corrclated 
set. For example, subjects (patients) who 
regard the Taffel (1955) sentence-construcé 
tion task as a personality test sometimes 
assume they are supposed to make SP sen- 
tences about themselves and their families. 
n response to either rando ystematic 


m or SY 
reinforcement, subjects with such response 


indicated below, the verbal condition- 
ing paradigm may not provide an ap- 
propriate framework for clarifying the 
more general question of whether hu- 
man operant learning can occur with- 
out awareness. 


Subject Vigilance 


Most subjects in psychological ex- 
periments, as Orne (1962) has pointed 
out, tend to be alert to manipulations 
that the experimenter may impose upon 
them and vigilant vis-à-vis their own 
behavior. In almost all verbal condi- 
tioning research, the subject is aware 
that he is participating in a psychologi- 
cal experiment. Thus, subjects in ver- 
bal conditioning experiments ordinarily 
notice the reinforcing stimulus and at- 
tempt to ascribe some meaning to it, 
making learning without awareness un- 


likely. 
Response Monitoring 


Despite reinforcement and no ad- 
monition to the contrary, reinforced 
words are rarely repeated in word- 
naming verbal conditioning tasks. This 
almost universal tendency not to repeat 
words would be difficult to explain 
by automatic reinforcement theories. 
When asked why they did not repeat 
reinforced words, subjects usually in- 
dicate that they believed they were 
supposed to give a variety of words. 
It would appear that the subject keeps 
both his past responses and their con- 
sequences in mind and monitors new 
responses according to his hypotheses 
about the experiment. Thus, on the 
assumption that selective responding is 
guided by response monitoring in ac- 
cordance with the subjects’ hypotheses, 


tions may interpret the reinforcing 
s as a sign of the experimenter's 
approval or encouragement and therefore 
give increasingly more sentences beginning 
with “I” or “We” (Spielberger, DeNike, & 


Stein, 1965). 


disposi 
stimulu: 
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to repeat reinforced 
words in verbal conditioning um 
understandable, and learning without 
awareness seems improbable. 


the tendency not 


Response Sets 

Some subjects approach verbal con- 

ditioning with a problem-solving set, 
while others interpret the conditioning 
task as either a personality test or an 
attempt to make them “conform.” 
When a problem-solving set is com- 
bined with response monitoring in a 
vigilant subject, the subject notices the 
reinforcing stimulus, proceeds to test 
out successive hypotheses about the ex- 
periment, and is likely to become aware 
of a correct or correlated response-re- 
inforcement contingency. In contrast, 
subjects with a personality-test set may 
get so involved in their free associations 
(on a word-naming verbal conditioning 
task) that they are oblivious to the re- 
inforcing stimulus or tend to regard it 
as annoying and disruptive. Other 
subjects with personality-test sets de- 
vote considerable effort to the selection 
of “innocuous” or “unrevealing” words 
or otherwise defensively edit their re- 
sponses so that they will not reveal 
personal secrets or shortcomings. Sub- 
jects who believe the experimenter is 
attempting to manipulate their behavior 
often report that they deliberately ig- 
nored the reinforcing stimulus or tried 
to avoid it. Thus, while a “conform- 
ity” or “personality” interpretation of 
the verbal conditioning task tends to 
prevent learning without awareness by 
causing subjects to ignore or avoid the 
reinforcing stimulus and/or to perceive 
it as unpleasant, a problem-solving ap- 
proach tends to preclude learning with- 
out awareness by making it easier for 
the subjects to become aware. 

The foregoing analysis of the “de- 
mand characteristics” (Orne, 1962) of 
verbal operant conditioning suggests 
that it is unlikely that learning without 
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awareness will be found in laboratory 
experiments which employ the methods 
traditionally associated with the verbal 
conditioning paradigm. Under such 
conditions, it might be expected on a 
priori grounds that subjects will be 
particularly vigilant about their own 
verbal responses which are likely to be 
more carefully monitored than most 
other behavior. Or, in Eriksen's 
(1960) words, “. any situation 
where the cues and the reinforcements 
are salient enough to produce learning, 
will not escape detection by awareness 
[p. 298].” 

It would appear more plausible to 
seek evidence of learning without 
awareness in situations where subjects 
are not aware that they are participat- 
ing in a psychological experiment and 
on tasks which do not induce intensive 
response monitoring or defensive re- 
sponse sets. A likely place to look for 
such evidence might be in settings 
where subjects are motivated not to be 
aware, as, for example, in experimental 
situations involving processes such as 
repression, dissonance reduction, con- 
formity, and ingratiation. At present, 
only scattered attempts have been 
made to investigate human operant 
learning with research designs repre- 
sentative of some of the forenamed con- 
ditions (e.g, Centers, 1963; Gergen, 
1965; Goldiamond, 1965; Hefferline, 
Keenan, & Harford, 1959; Kimmel & 
Baxter, 1964; Verplanck, 1956; Vogel- 
Sprott, 1964). In such studies, how- 
ever, the role of awareness has ordi- 
narily not been the focus of investiga- 
tion, and, consequently, the methods 
employed to assess awareness are sub- 
Ject to the criticisms discussed earlier. 
Therefore, where learning-without- 
awareness effects have been found, little 
confidence may be placed in such re- 
sults. This in no way implies that hu- 
man operant learning cannot take place 
without awareness, but merely affirms 
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Adams’ (1957) observation that such 
learning has not been demonstrated 
convincingly in the laboratory. 


Some Final Considerations 


We have endeavored in this paper to 
compare the relative adequacy with 
which descriptive behaviorism and cog- 
nitive learning theory can account for 
the findings of verbal operant condi- 
tioning experiments. We have argued 
that the descriptive behaviorists! im- 
plicit rejection of awareness as à con- 
cept has had serious methodological 
consequences which have retarded the 
convergence of empirical findings and 
the development of adequate theory. 
We have also contended that the uti- 
lization of awareness as 4 systematic 
concept has led to experimental find- 
ings which the Skinnerian approach 
would not have predicted. Finally, we 
have presented evidence to support the 
view that the reinforcing stimulus in 
verbal conditioning has both informa- 
tion and incentive value and that “what 
is learned” in verbal conditioning is 
awareness of a correct (or correlated 
response-reinforcement contingency. 


With respect to whether human op- 


erant learning can occur without aware- 
hat although 


ness, it should be noted t 
this question would appear to have an 
empirical answer, in verbal condition- 
ing the answer turns out to be in- 
extricably tied to an investigator's 
theoretical orientation and, more funda- 


mentally, to his epistemological as- 


sumptions concerning awareness as a 
1962, 1965). 


concept (Spielberger, 7 

The issue is further complicated by the 
fact that different learning xo im; 
ply different operational definitions © 
the learned response (Campbell, 1954), 
leading in verbal conditioning to the 
collection of noncomparable data and to 


fruitless theoretical controversy. This 


controversy, Which revolves about the 
role of awareness in verbal condition- 
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ing, is reminiscent of earlier disagree- 
ments among S-R and cognitive the- 
orists concerning the role of reinforce- 
ment in learning. It has since been 
recognized that much of the contro- 
versy and confusion with respect to 
reinforcement stemmed from the failure 
of learning theorists to differentiate 
conceptually between the law of ef- 
fect as an empirical statement and 
as a general theory of reinforcement 
(Spence, 1951). 

On an empirical level, most would 
agree with the observation that per- 
formance in a variety of tasks is fa- 
cilitated by reinforcement, but S-R and 
cognitive learning theories are still 
sharply divided on the question of 
whether reinforcement is required for 
learning to take place. Similarly, it is 
generally accepted that performance 
Will be facilitated by the presence of 
appropriate cognitive states; that is, an 
“empirical law of cognition” is as sup- 
portable as the empirical law of effect,’ 
but whether cognitive processes medi- 
ate any and all forms of learning is as 
dubious as a general theory of rein- 
forcement. Thus, while a “theoretical 
law of cognition" might well serve to 
explain the findings in verbal condition- 
ing experiments with human adults, it 
is quite another matter to demonstrate 
the role of cognitive processes in the 
learning of animals and preverbal chil- 
dren. 

Nevertheless, the careful investiga- 
tion of subjects' awareness in relation 
to other behavior is, we believe, requi- 
site to the development of an adequate 
theory of human learning. Progress in 
this direction will depend upon the 
general acceptance of experimental pro- 
cedures which permit sensitive evalua- 


| to Donald L. Thistle- 
ng the analogy between 
an empirical law of cogn 


ition in verbal con- 
ditioning and the empirical law of effect in 


traditional discussions of learning theory. 


7 We are grateful 
thwaite for suggest! 
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tion of cognitive processes. These pro- 
cedures must surely include subjects 
verbal reports and must provide neces- 
sary safeguards against biasing or dis- 
torting such reports. Furthermore, it 
must be clearly recognized that the 
validity of verbal reports in any experi- 
mental context rests upon the willing- 
ness and linguistic competence of sub- 
jects to describe their mediating states 
when properly questioned (Dulany, 
1961), and that verbal reports are but 
imperfectly related to the subjects' in- 
ternal states (Eriksen, 1960). Para- 
doxically, perhaps the most significant 
contribution of verbal conditioning re- 
search has been the stimulation of in- 
terest in verbal report procedures and 
in concepts such as awareness among 
psychologists who are inclined to insist 
that thoughts and ideas are beyond the 
limits of scientific inquiry. 
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Expectancies of the form *"aa'Eo" (ie. “Act a rather than Act a' is 
expected to lead to Outcome o”) are diagnosed by positive association 
between the relative frequencies of a pair of alternative acts, à and a’, 
and a pair of opposed preference states, oPo' and o'Po. Controls to 


exclude artifactual association due to response biases are described. 
Properties of such expectancies and experimental problems that arise 


^ 


in diagnosing them are discussed, and compariso: 
izations are made. 


, Although the concept of expectancy 
is central to "cognitive" theories of be- 
havior, proponents of such theories 
may be said to have had more success 
in using it polemically than in analyzing 
it and setting forth its empirical cri- 
teria. What follows is an attempt in 
the latter direction. 


EMPIRICAL CRITERIA OF Acr- 
OUTCOME ExpECTANCIES 
s, the fundamental 


notion of cognitive theory is that an or- 
ganism’s choices are determined by 
what it wants and by what it expects. 
If so, then to know what it expects, we 
must know how its choices vary with 
what it wants. To illustrate : 


1. Suppose that a child confronted 
plue one 


with two boxes opens the 
when he prefers a toy to a trinket and 
opens the yellow one when he prefers 
a trinket to a toy. Then @ cognitive 


theorist will conclude that the child ex- 


In simplest term: 


1 This investigation was supported in part 
-06580 to the Uni- 


Public Health Service. 
ful to R. L. Solomon, R. 
Joan Gay Snodgrass for 

The following terms will be used in 
senses given them in previ 

1958, 1961): act, outcome, ifferential ou 
comes, null differential outcome (ø), pref- 
crence, discrimination. 


pects that opening the blue box rather 
than the yellow one will get him a toy 
and that opening the yellow box rather 
than the blue one will get him a trinket. 
2. If a rat in a T maze turns right 
food to water and turns 
fers water to food, then 
according to cognitive theory it expects 
turning right rather than turning left 
to lead to food and turning left rather 
than turning right to lead to water. 


The paradigm appropriate to test- 
ing for such act-outcome expectancies 
is represented in Table 1. Let a and a’ 
be any pair of alternative acts of which 
one and only one occurs in any trial, 


when it prefers 
left when it pre 


TABLE 1 


PARADIGM FOR EsTABLISHING EXISTENCE OF 
THE EXPECTANCIES 


aa! Eo, a'a Eo, aa' Eo", AND a'aEo 
Conditions 
Alternative 
acts 1 2 v 2 
oPo' o'Po oPo' o'Po 
F = ES 2 
a 
(o) (0) (o^) (o) 
= + + e 
a 


(o) (o) (o) (0) 


Note.—0, 0": differential outcomes; oPo', o'Po: op- 
posed preference states. Plus and minus signs indicate 
association between acts and preference states. 
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and let o and o' be any pair of differen- 
tial outcomes (Irwin, 1961). In Con- 
dition 1, o is preferred to o (oPo') ; 
in Condition 2 the reverse obtains 
(o'Po). Then a positive association 
between the relative frequencies of Acts 
a and a’ and the preference states oPo 
and o'Po, as suggested by the plus and 
minus signs in the cells of the table, 
is evidence for the existence of two 
expectancies, namely, that Act a rather 
than Act a' will lead to Outcome o, and 
that Act a’ rather than Act a will lead 
to Outcome o'. These two expectan- 
cies may be symbolized by aa'Eo and 
a'aEo', respectively, and o and o' may 
be called their expectanda. 

It is possible, however, for such re- 
sults to reflect merely an artifact of re- 
sponse bias. For example, if a bias 
‘favoring a over a’ is for some reason 
associated with oPo' but not with o'Po, 
then Conditions 1 and 2 will show an 
association between the acts and the 
preference states, but inferences con- 
cerning expectancies will be unwar- 
ranted. The experimenter may know 
his organism well enough not to be 
worried by this possibility, but to 
eliminate it experimentally requires 
only the control procedures of Condi- 
tions 1’ and 2’, which attempt to re- 
verse the direction of the association 
found in Conditions 1 and 2. Tf this is 
successful, not only is the artifact ruled 
out, but the existence during the con- 
trol trials of two different expectancies, 
aa Eo' and a'aEo, is demonstrated. 
Such control procedures are often car- 
ried out as a matter of course, although 
sometimes with subjects (Ss) other 
than those of Conditions 1 and 2 (see, 
for example, Kendler, 1946). 

Although Table 1 constitutes the par- 
adigm for establishing whether or not 
certain expectancies exist at the time of 
testing, it implies nothing about how 
they may have been acquired. One 
could accept from a colleague a group 
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of Ss—rats, children, monkeys, adults 
and, in complete ignorance of their 
past, still undertake to fnd out whether 
they had some specified expectancy. 
This expectancy might have been in- 
duced by training Ss with repeated 
trials in a design like that of Condi- 
tions 1 and 2 of Table 1, the outcome 
of each trial being what is represented 
in parentheses in the cells of the table. 
Or, in the case of organisms capable 
of language, Ss might have been told 
what to expect as the result of their 
choices in the situation. Or Ss might 
have been exposed to the sight of other 
Ss making choices in the situation and 
obtaining the outcomes from them, as 
in experiments on “imitation.” What 
is needed is not information about this 
history but rather control over Ss’ 
preferences ; with this, the expectancies 
can be diagnosed if they exist. 


COMMENTS AND DISCUSSION 
Formal Properties 


The following properties of the ex- 
pectancies under discussion will be 
postulated. 


1. Since a and a’ are taken to be al- 
ternative acts in a binary-choice situ- 
ation, they are nonidentical and mutu- 
ally exclusive. Hence, the expression 
aaEo is not defined. 

2. At any moment in time, with o 
held constant : If aa' Eo, then not a'aEo; 
that is, aa Eo is asymmetrical in a and 


1 


d. But (not aa’Eo) does not imply 
a'aEo. 

3. At any moment, with o held con- 
stant: If aa'Eo and a'a"Eo, then aa"Eo; 
that is, the relation between the acts is 
transitive. 

In discussing the critical experi- 
ment required for establishing that an 
expectancy of the type aa'Eo exists, it 
was pointed out that obtaining the evi- 
dence for one such expectancy entails 
obtaining evidence also for another ex- 


p 
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pectancy closely related to the first. 
This linkage can be expressed sym- 
bolically : 


(a)(a') (Go)aa’Eo= o')a'aEo".. (1 


That is to say: Given any pair of al- 
ternative acts, to assert that an organ- 
ism expects one act rather than the 
other to lead to some outcome, 0, is 
material equivalent to asserting that 
there is some other outcome, o', such 
that the organism expects the second 
act rather than the first to lead to it. 

The freedom to choose o' in Formula 
1 is material. For example, one of the 
expectancies in Illustration 2 was that 
turning right rather than turning left 
would lead to a food outcome. This 
very same expectancy could be estab- 
lished in the same situation by chang- 
ing o' from the water outcome of turn- 
ing left to any other feasible differential 
outcome such as social contact with 
another rat, sexual commerce with a 
rat of the other sex, return to the home 
cage, etc. 


Ex pectancies as Relations 


Expectancies of the form aa'Eo are 
relation be- 


not composed merely of a 
tween a single act and it$ outcome ; 
they involve a relation. between two 
act-outcome sequences with a common 
outcome. Roughly speaking, an organ- 
ism having such an expectancy may be 
said to relate the act-outcome sequence 
a leads to o" to the Sequence e 
leads to o" and to behave as if the for- 
mer were more likely than the latter to 


be true, Earlier discussions of expect- 
ancies have not brought this relationa 
into focus 


character of the concept int 
is true that common speee 


the relation to go unexpressed, but, al- 
though one may say, “J expect to arrive 


in time if I take the train,” it is clear 
that some alternative act is to be un- 
derstood, such as, “rather than take à 


bus" or "rather than not taking the 


train” or the like. It should be noted 
that the alternative act is not neces- 
sarily the simple negative of the origi- 
nally specified act. : 
If an organism can form such ex- 
pectancies, it must also be able, at least 
implicitly, to form rules about temporal 
sequences of acts and outcomes. Such 
a supposition is natural to cognitive 
theory, the spirit of which is to under- 
stand the organism’s behavior as de- 
termined in part by its knowledge of 
nature—at least, of that part of nature 
commonly referred to as its environ- 
ment. It is also presumably meaning- 
ful to conceive of expectancies that do 
not refer to act-outcome sequences, 
such as the expectancies that it will rain 


tomorrow, that a friend will arrive on 


the five o'clock train, that a shock will 


be felt soon after a buzzer is heard, etc., 
but no attempt to deal with these will 


be made here. 


Reversal of Preference 

No doubt the reader will have felt 
that the requirement of experimentally 
reversing the preference relation in or- 
der to demonstrate expectancies is 
easier to demand than to accomplish. 
This feeling is justified; indeed, it can 
be argued that the fact that it is easier 
to reverse expectancies in order to 
demonstrate preferences than to reverse 


references in order to determine ex- 
pectancies is a principal reason why 
there are many more experiments on 
the analysis of preferences than on the 
analysis of expectancies. Usually, ex- 


ectancies are modified by giving the 
organism experience with a changed 
external situation, and preferences are 
varied by operations related to the bi- 
ological state of the organism. Al- 
though the latter is often the harder 
task, increased knowledge, especially in 
physiological psychology. is already 
reducing the difficulty. 

In many cases, of course, the re- 
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versal of preferences can be achieved 
by establishing contingent preferences 
between the outcomes under study and 
some pair of differential outcomes 
whose relative preference values for 
the organism are known and suffi- 
ciently definite. Nothing is more com- 
mon in decision-making experiments 
than just this. It may be a matter of 
indifference to S whether a coin toss 
emerges as "heads" or "tails" or a 
drawn card is marked or blank or a 
key press produces a buzz or silence 
until he learns that money gains and 
losses are associated with these events. 
The criteria of expectancies given here 
require that S have preferences be- 
tween the outcomes, but they do not re- 
quire that these preferences be in some 
way intrinsic to the outcomes. No one 
(or, at any rate, no cognitively dis- 
posed psychologist) can doubt that 
Wolfe's (1936) chimpanzees formed 
various expectancies in which poker 
chips were the expected outcomes, but 
it is equally clear that the values of 
these tokens were formed by differen- 
tial association with food, water, play, 
and other outcomes whose own values 


had a history that could only be sur- 
mised, 


Strength of Expectancies 


It is assumed that expect: 
in their relative strength, but no for- 
mal treatment other than that of the 
third postulate stated above will be at- 
tempted here. It is obvious that to 
know that an expectancy exists is not 
to know its strength. For example, a 
child’s expectancy that Opening a blue 
box rather than a yellow one will gain 
him a toy may be weak, medium, or 
strong, and, in particular, to speak of 
the toy as the expectandum does not 
imply that the child sees the occurrence 
of this outcome as more likely than its 
nonoccurrence. Experiments could he 
arranged in which, although the prob- 


ancies vary 
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-ability of gaining a toy by opening the 
blue box was always greater than that 
of gaining the same toy by opening the 
yellow box, ‘both probabilities were 
either higher or lower than .5, and the 
relative strength of the expectancies 
would presumably vary with these 
probabilities. 


Expectancy Versus Drive Discrimina- 
tion 


If the rat of Illustration 2 is thought 
of as being switched between the two 
states, “hungry but not thirsty” and 
“thirsty but not hungry,” and Out- 
comes o and o' are, respectively, food 
and water, then Table 1 is the design 
of certain "drive-discrimination" ex- 
periments, such as Leeper's (1935) 
main experiment and Kendler's (1946) 
test trials. In these instances (except 
for a few correction trials in Leeper's 
case), if the animal turned one way, 
it always found food and if it turned 
the other way, it always found water. 
This is a design for diagnosing expect- 
ancies, not discrimination; hence, the 
experiments are to be interpreted as 
demonstrating the existence of such 
expectancies as, say, that turning right 
rather than turning left will lead to 
food, and that turning left rather than 
turning right will lead to water. 

Among other drive-discrimination 
experiments, those that used a cor- 
rection method require special analysis 
and will not be classified here, These 
include Hull’s (1933) original study 
of the problem and such recent experi- 
ments as those of Levine (1953), 
Bolles and Petrinovich (1954), and 
Deutsch (1959). A clear-cut discrimi- 
nation design was, however, used by 
Jenkins and Hanratty (1949), When 
their rats turned right, they found food 
if they were less hungry and an empty 
box if they were more hungry di 
versus 473 hours of food deprivation) ; 
When they turned left, they found fooc 
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if they were more hungry and an 
pomy box if they were less hungry. 

he outcomes of the acts were contin- 
gent upon the states of hunger that 
were used as discriminanda, thus ful- 
filling the fundamental criteria of a 
test of discrimination (Irwin, 1958). 
In general, tests of an organism’s dis- 
crimination between two preference 
states such as oPo' and o'Po require 
the paradigm of Table 2. The pattern 
of plus and minus signs in this table 
represents positive results of the test. 
-The use of Outcomes 1 and n instead 
‘of o and o' in the cells of the table 
calls attention to the fact that tests of 
discrimination between two preference 
states can be made without employing 
outcomes relevant to those states. 
'There must, of course, be a preference 
relation between 7! and n; in the use 
of the table, it is assumed that a Pn. 
So, for example, to test for discrimina- 
tion between the preferences for food 
over water and for water over food, one 
could let m be the turning © of an 
aversive bright light by pressing the 
correct one of two panels, and n be the 
null differential outcome of no change 
in illumination from pressing the 1n- 
correct panel. Had Levine (1953) 
and Bailey (1955) not used a correc- 
tion method, this description would 
hold for their experiments. 


CowPARISONS WITH OTHER 
ForMALIZATIONS 


Tolman, Ritchie, and Kalish 
Tolman etal. (1946, 


DF. IL If is deprived of food and x has 
is now put on 


path P and path P is nov 
are other paths which lead 2 A 
P, one of which points directly to location 
L, then (x runs down the path W d at 
directly to location Les expects food a 
location 
The present author's treatment of 
this situation would require several 
suppositions whose acceptability to Tol- 


TABLE 2 


PARADIGM FOR DISCRIMINATION BETWEEN 
THE STATES  oPo' AND o'Po 


Conditions 

Alternative = n Ses 
acts 1 8 " E 
oPo' o'Po | oPo' o'Po 
aT oe 

a | | 
(m) (n) | (n) | (m) 
2-191449 

a’ | | 
(n) (m) (m) | (n) 


: differential outcomes with m preferred 
o: opposed preference states. Plus and 
icate association between acts and pref- 


man et al. may be questionable. First, 
it would be supposed that ** is de- 
prived of food" empirically implies that 
the animal desires food, that is, fP¢.° 
It would further be supposed that the 
statement, “x runs down the path which 
points directly to location L," means 
that the animal chooses this act as 
against some other act, such as running 
down another path or not running at 
all. Finally, it would be supposed that, 
if food (f) could be made aversive to 
the animal, that is, if @Pf, then the 
animal would perform the alternative 
act of not running or of choosing a 
different path. Given all of this, the 
results would fit Table 1, and the pres- 
ent author would accept the conclusion 
that the animal expected the act of 
running down the L-pointing path 
rather than committing the alternative 
act to lead to food. and also that the 
animal expected that committing the 


alternative act rather than running 
down the L-pointing path would lead 
to $. 


Two additional comments are neces- 
sary. The first has to do with the bi- 
conditional in the last clause of the 


Tolman et al. definition. Even if posi- 
tive results in accordance with this 


2 See Footnote 1. 
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definition permit the diagnosis of n 
expectancy, it is surely too iong o 
maintain that the converse 1s true, tha 
is, that the existence of the expectancy 
necessarily leads to the appropriate 
choice behavior. Presumably Tolman 
et al. themselves would agree that the 
definition does not provide a statement 
of the sufficient conditions for the be- 
havior. To do so would be to give a 
complete psychological theory, which 
the “definition” obviously does not do. 
The present author's defining criteria 
are subject to the same limitation. It 
is perfectly possible for an organism to 
have the required preference states and 
expectancies and yet, when introduced 
into the experimental situation, not 
make any choice at all, and the prefer- 
ences and expectancies may be present 
when the organism is outside the ex- 
perimental situation and therefore in- 
capable of expressing them. Tf the 
behavior does not occur, there is no 
evidence, not negative evidence. 
Second, the expectancy of Tolman 
et al. is defined under the condition 
that the animal has had a particular 
sort of training and has learned the ex- 
pectancy during this training by a proc- 
ess of inference. But these are verv 
restrictive conditions. On the one 
hand, it is more than possible that such 
expectancies might be formed by or- 
ganisms that were incapable of draw- 
ing the inferences required by Tolman 
et al.; on the other, even granted such 
capability, the generality of such a con- 
ditional definition is far less than is 
desirable for a concept so fundamental 
to a cognitive view. 


MacCorquodale and Meehl 


In what is probably the most 
thorough attempt to formalize the no- 
tion of expectancy. MacCorquodale and 
Meehl (1953) set forth “preliminary 
suggestions" of a postulate system simi- 
lar to Hull's but differing from it criti- 
cally by substitution of "strength of ex- 
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pectancy," (S,R,S.), for Hull's “habit 
strength," sHr. The postulates pro- 
vide for the growth of strength of ex- 
pectancy by frequency of occurrence of 
the sequence S$, > R, — S, when these 
elements are in close temporal con- 
tiguity, for weakening ("extinction") 
of the expectancy by occurrence of the 
sequence $,— R, without its being 
followed by S, and for the develop- 
ment (by "inference") of an expect- 
ancy (S,R,S*) if the sequence S,S* 
occurs when (S,R,S,) already exists. 
The system is non-Hullian in that 
strengthening and weakening of ex- 
pectancies by confirmation and discon- 
firmation take the place of growth of 
habits by reinforcement and extinction 
of responses by inhibition. 

In comparing the MacCorquodale- 
Meehl expectancies with those de- 
scribed here, it must be kept in mind 
that the former are defined implicitly 
by their roles in a postulate system. 
whereas the latter are given explicit 
and quasi-independent definition by 
means of a canonical form of evidence. 
One consequence of this is that, while 
(S,R,S,) is defined in part by postu- 
lates that deal with its acquisition. 
aa'Eo is defined without reference to 
this or any other "antecedent" condi- 
tion. If this were the only major dif- 
ference, it might be left for argument 
at the level of philosophy of science. 
There are, however, two other differ- 
ences that appear to bear more directly 
upon the psychological nature of the 
processes under discussion: (a) the 
presence of the S, terms in the Mac- 
Corquodale-Meehl (S,R,S.) and the 
absence of corresponding terms from 
aa'Eo, and (b) the reference to an al- 
ternative act in aa'Eo and the absence 
of this from (S,R,S,). 

The first of these two differences. 
(a), is more apparent than real. It 
seems to do no violence to MacCor- 
quodale and Meehl’s (S,R,S,) to de- 
scribe it as the expectancy that R, will 
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lead to S,, given that S, is originally 
present. Perhaps something is gained 
by the explicitness of this reference to 
the , conditions under which the or- 
ganism expects R, to result in Sa, but 
the present author's preference here, as 
also in connection with the problem of 
the relation between the definition of 
expectancies and their laws of acquisi- 
tion, is to leave the matter for experi- 
ment to decide, rather than to put tight 
limitations on it at the level of postu- 
late and definition. The present author 
supposes that the conditions under 
which such an expectancy as aa Eo will 
actually exist are among the very 
matters of fact that the psychologist 
should determine by experiment, Note 
that if the above translation of (S,R,S2) 
into words is accepted. then this is the 
same expectancy as (SpR,S2). where 
S, is any arbitrarily chosen stimulus— 
the difference is merely that the ex- 
pectancy R,S, is contingent upon S, in 
the first case and upon Sy in the second. 
This does not seem to be incompatible 
in any way with MacCorquodale and 
Meehl's system, nor are these two €X- 
Pectancies incompatible with each 
other. 

As far as (b) is concerned. the fact 
that aa’Eo contains a reference to @ 
Pair of alternative, mutually exclusive 
acts, whereas (S,R,S2) does not, Te- 
flects at least a difference in the starting 
points of the schemes of which they 
form parts. The MacCorquodale- 
Meehl system shows its S-R paternity 
in that its variables are directed towar 
accounting for the quantitative proper- 
ties of particular "responses" bn 
final intermediary variable in the sys- 
tem is symbolized “Ep” and is spoken 
of as “the reaction potential — OF 8 
response R, in the presence OF 2i `- E 
[p. 58]. Neither explicit nor im- 
Dlicit reference to choice by the or- 
ganism is found in the postulates except 
for the statement: “The oscillations o 
two different „Ep’s are treated as ™- 
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dependent, and the response which is 
momentarily ‘ahead’ is assumed to be 
emitted [p. 58].” On the other hand, 
the present paper sets out from a choice 
situation and makes the determination 
of one expectancy, aa'Eo, interdepend- 
ent with the determination of another, 
a'aEo'. 

Similarities and contrasts between 
the two conceptions can perhaps be 
seen best by asking whether there is an 
equivalence between the assertions 
*aa' Eo" and "(S,RS)/(S,R'S.) > 1” 
(where R and R' are mutually exclu- 
sive responses), thus taking one of our 
expectancies to be analogous to the ratio 
of two of MacCorquodale and Meehl’s 
expectancies. For the equivalence to 
hold, MacCorquodale and Meehl would 
have to maintain that *(S,RS/ 
(S,R'S,) > I" is materially equivalent 
to (3S) [CR S5) NSRS > 1)" 
in order to preserve the analogy to the 
material equivalence between "aa'Eo" 
and * (30')a'a Eo'." Such a claim 
seems inappropriate to their system 
and would at the very least require 
elimination of their assumption of 
independence among various sEx’s. 
This difference between the two sets 
of ideas is a direct and instructive con- 
of the fact already mentioned. 
namely, that the one intends to study 

esponses and the other in- 


individual r 
tends to study phenomenon of choice. 


One further point must be made. 
MacCorquodale and Meehl mention the 
difficulty experienced by Kendler 
(1946) in accounting for his own re- 
sults without, to use their phrase. 
“smuggling in" the roso mechanism. 
They attribute their own ease in han- 
dling the same results to the fact that 


their 


.. . activation postulate makes reference to 
the expectandum. » -> The expectancy frame 
retains the reference to the expectandum in 
the basic cognitive element (S:RS2) and thus 
this [expectandum] can be 


a reference to tuS i : 
packed into the activation postulate in a way 


sequence 
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ent drive manipulations 
t allows subsequen 
aee control over the response strength 


[1953, pp. 59-60]. 


It seems that MacCorquodale and 
Meehl misplace their emphasis in these 
statements. It is important for “cog- 
nitive elements" of a theory of be- 
havior to refer to outcomes of acts, but 
such reference is equally necessary in 
the motivational terms of the theory. 
MacCorquodale and Meehl provide ex- 
actly this in their activation postulate, 
the first sentence of which reads: “The 
reaction potential sEp of a response R, 
in the presence of S, isa multiplicative 
function of the strength of the ex- 
pectancy (S,R,S,) and the valence 

(retaining sign!) of the expectandum 

[p. 58].” Surely it is the reference 

to the outcome in "valence of the ex- 
pectandum" rather than in the ex- 

pectancy that is critical for the superi- 
ority of their system over S-R Systems 
that employ, smuggled in or not, the 
77-5, mechanism (cf. Deutsch, 1956), 

Spence (1956) maintains that Sy con- 
Stitutes just such a reference to an out- 
come, and Osgood (1953, pp. 391-412) 
takes a similar attitude; but Ty-$, Te- 
sembles an expectancy rather than a 
preference. Such problems do not 
arise in the present System, since both 
the motivational and the cognitive 
processes that are involved in a choice 
refer to the same outcome. The chosen 
act is dependent upon a preference, 
oPo', and an expectancy, aa'Eo, that 
are interlocked through the identity of 
0, Which is simultaneously the preferred 
outcome and the expectandum. 
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NATURE OF THE EFFECT OF SET ON 
PERCEPTION * 


RALPH NORMAN HABER 


University of Rochester 


To explain the effects of set on reports of 


potheses are elaborated—set enhances the pe 
al organization o 
the set. Extensive research is reviewed 


e hypotheses, especially studies that at- 


tates responses and memori 
that itself was uneffected by 
that is relevant to each of thes 
tempt to differentiate them. 


_ When a perceiver is given instruc- 
tion to attend to only one of several 
attributes of a stimulus, he is able to 
Teport his perception of that attribute 
More accurately than he would be able 
to otherwise,  Külpe, in 1904, first 
documented this effect experimentally, 
and, since then, a number of investi- 
gators have replicated the basic finding. 
The operations have typically involved 
either instructing the subject (S) as 
to which attribute he should attend or 
providing S with a set of alternatives, 
from which the stimulus has been 
drawn. However, the same process is 
implied by implicit instructions ema- 
nating from 5 himself, such as WOW 


occur when a hungry man is looking 


for food. The experimental demo? 
Stration requires that the set cannot 
jechanisms 


be satisfied by peripheral n 
Such as eye movements Or 5 
Bes stimulus to locate the re 
tibutes, since, otherwise, the phe- 
nomena would not be as interesting. 
This paper is not a general review 
the effects of set on perception, 
though such a review is badly needed 


canning O' 
levant at- 


of 
al- 


1 Partial suppor r this research Was 
Provided by a Dod the United States 
able Health Service ( 03244) to ie 
hahar at Yale University, Where this wor 
‘ed been completed. I am most appreciatie 
i5 "Thomas Natsoulas and Maurice Hershen" 
Son for their many valuable comments on the 
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perceptual experience 2 hy- 
rcept directly, or set facili- 
f the perceptual experience 


to bring Gibson’s (1941) discussion up 
to date. Rather, the topic here is an 
examination of the two basic hypothe- 
ses that have been used to interpret 
set effects. As these issues were not 
so clearly developed at the time of 
Gibson’s review, little reference can be 
made to his paper- 

At least two basic and dissimilar in- 
terpretations have been suggested to 
explain the effects of set on perception. 
The older one, favored by Külpe and 
most of the investigators following him, 
including the “New Look" theorists in 
perception, appeals to a perceptual en- 
hancement or "tuning" hypothesis 
(Dember, 1960; Postman, 1963), 
whereby attending to à particular at- 
tribute of a stimulus results in a clearer 
and more vivid perception of that at- 
tribute—it stands out more. By the 
same token, the incidental attributes 
are not as clear and do not stand out. 
Thus, the perceptual tuning hypothesis 
places the locus of the effect of set in 
the perceptual system itself, occurring 
while the stimulus is being viewed. 

The alternative hypothesis is that 
set has no effect on perception itself, 
but only on some aspects of the memory 
trace or on responses to that perceptual 
experience. Several varieties of this 
“response” hypothests have been pro- 
posed : (a) The set facilitates relevant 
responses by S, increasing the prob- 
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ability that S can. identify the ae 
in his report; (b) the set causes & ? 
report the emphasized see ga et 
before memory of the stimulus fades, 
thus allowing those attributes to be re- 
ported more accurately; (c) the set 
modifies the organization of the mem- 
ory trace, so that the important attri- 
butes are remembered more accurately. 
The two basic hypotheses have wider 
application in perception and memory 
than just to problems of set. Dehind 
them lies specification of one of the 
most difficult distinctions in any analy- 
sis of perceptual behavior. The tend- 
ency has been to assume that just be- 
cause S was reporting immediately 
what he had perceived he was therefore 
performing a "perceptual" task. There 
is a crucial difference between report- 
ing one's experience and reporting the 
attributes of stimuli that one remem- 
bers seeing. One task is perceptual, 
while the other deals with a memorial 
process. These must be conceptually 
differentiated, even if they may func- 
tion similarly, and even if they pro- 
ceed originally from the same sensory 
base. There are innumerable experi- 
ments that fail to make this distinction, 
and much of the controversy among 
perceptual theories results directly 
from this confusion. The two hypothe- 
ses under discussion in this paper re- 
flect the distinction being proposed. 
To provide evidence relating to either 
or both of these requires that opera- 
tional definitions be made for meas- 
urement of perceptual experience di- 
rectly, or that deductions from these 
hypotheses specify operations in terms 
of memory alone that will still permit 
inferences concerning perceptual ex- 
perience. As the present discussion 
will amply illustrate, neither of these 
conditions is well met in practice, 
though abundant data are available 
regarding both of these hypotheses. 
Needless to say, the carliest experi- 
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ments (e.g., Külpe, 1904; Rubin, 1913; 
Wilcocks, 1925; Yokahama, reported 
in Boring, 1924) were not designed to 
separate these two basis hypotheses, 
even though the hypotheses were dis- 
cussed as alternative explanations for 
the various results. 

Külpe's (1904) stimuli were groups 
of letters varying in color and location. 
Prior to most presentations, S was 
given a set to note only one of the 
several attributes of the stimulus: the 
number, position, or color of the let- 
ters, or the overall figure formed by the 
letters. For the remaining stimuli, no 
specific set was given. After the 
presentation, S was instructed to re- 
port the attributes as best he could, 
with no order specified. If he left 
any out, the experimenter (/) ques- 
tioned him on those, Külpe found that 
S was more accurate in reporting the 
attribute for which he had paid at- 
tention prior to the presentation and 
was less accurate in reporting the 
others. Külpe suggested that the at- 
tention instructions facilitated the clar- 
ity of the percept. 

Wilcocks (1925), responding to a 
criticism of Kiilpe by Rubin (1913). 
extended Külpe's method to include a 
memory variable. Wilcocks argued 
that if the reports of attributes after 
the presentation were in an uncon- 
trolled order, it is possible that S might 
have forgotten some of them by the 
time he was questioned. Therefore, 
Wilcocks randomly arranged the order 
of reporting the attributes, though, in 
other respects, his experiment was 
similar to Külpe's, He found in gen- 
eral that accuracy was poorer the later 
an attribute was reported. Even so. 
he found that when order of report was 
controlled, the report of the attended-to 
attribute was more accurate than that 
of the unattended-to ones. Thus, while 
a memory variable was shown to be 
operative, it seemed to he independent 
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of the enhancement effect. Therefore, 
Wilcocks concluded that Külpe's basic 
finding was replicated and could not be 
reduced to a memory process. Like- 
wise, he accepted Külpe's interpreta- 
tion of this process as perceptual en- 
hancement. 

Chapman (1932) used similar fig- 
ures, where S could either count the 
letters, locate their positions, or name 
them. Instructions to attend to only 
one of these attributes were given in 
some sessions 4 seconds prior to the 
stimulus exposure, and in the remain- 
ing sessions 2 seconds following the 
exposure, in both cases by means of an 
illuminated display. Chapman's re- 
sults showed that accuracy of report 
Was higher when the attention instruc- 
tion was given before the stimulus ex- 
Dosure, as compared to afterwards. 
Comments made by Ss indicated that 
the attended-to attribute was perceived 
more clearly during the actual exposure 
of the stimulus. They felt they did 
Worse in the aftercondition because 
they had to keep too many things in 
mind, However, the comments also 
Suggested that Ss’ memory of the 
stimulus was affected by the type of set 
Siven, In the prior condition, they 1n- 
dicated that the memory for the ir- 
relevant attributes faded and became 
indistinct. Thus, Chapman conclu led 
that attention instructions given prior 
to the exposure both enhanced the per- 
ception of the stimulus itself during its 
Actual exposure and, further, started 
in motion a differential forgetting 
Process, where the relevant attribute 
Was maintained in memory more accu" 
rately and for a longer time. 

The evidence regarding the effect of 
Set reviewed to this point supports a 
Perceptual tuning hypothesis. primar” 
ily on the hasis of Ss’ postexperimental 
introspective comments. They report 
that the set-for attributes aT€ clearer. 
Stand out more, fade less rapidly, and 
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the like. Support for at least one of 
the response hypotheses is also found, 
based on before-after differences, or- 
der of report effects, as well as intro- 
spective comments regarding fading 
memory. In light of the importance of 
an S’s report of his perceptual experi- 
ence, the comments regarding experi- 
ence in these experiments must be 
given weight, even though they were 
not collected in any systematic or 
complete fashion nor subjected to any 
content analysis, and even though S 
was given no pretraining on the cate- 
gories to use in his report. (See Post- 
man, 1963, for a discussion of these 
criteria to be used for the evaluation of 
introspective reports in perceptual re- 
search.) 

While an S in this type of experi- 
ment might be confident that he noticed 
changes in the perceptual character- 
istics of the stimulus as his set changed, 
no converging operations (Garner, 
Hake, & Eriksen, 1956) sufficient to 
demonstrate the validity of those re- 

orts were employed. Introspectively, 
it might be very difficult for an S to 
differentiate a rapidly fading memory, 
for example, from an image that was 
perceptually less clear. Unless experi- 
ments are designed to guarantee that 
S can make this distinction introspec- 
tively (and the ones just reviewed 
clearly do not offer such a guarantee). 
they cannot provide unequivocal evi- 
dence regarding either of _ these two 
hypotheses. Therefore, this evidence 
can be considered only suggestive sup- 
port for the perceptual tuning hypothe- 
ses, as well as providing some evidence 
for at least one of the versions of the 
response hypothesis. . 

The New Look in perception, be- 
ginning after World War IT, accepted 
this evidence, however. as offering 
strong support for the perceptual tun- 
ing hypothesis (Bruner, 1957a, 19571. 
A number of experiments were then 


338 


rovide further 
reported Lge Capel a iacluding 
support Nel er (1954), Postman and 
ones uy Duns ner and Postman 
Bruner (1949), Bru 
1949), Krulee, Podell, and Ronco 
( and Anderson (1956), 
(1954), Green illi 1956 
Ross, Yarczower, and Williams (1956), 
Hoisington and Spencer (1958), to 
name only a few. For example, Post- 
man and Bruner (1949) showed that if 
S was given a multiple set of alterna- 
tives (e.g., one of two stimulus words 
will be either a food or a color word— 
report only that one), he had higher 
recognition thresholds than if a single 
set was given (e.g., one of two stimulus 
words will be a food word—report only 
that one). Thus, they argued that 
since the recognition thresholds were 
manipulated by the set, the set had 
actually changed the perceptual char- 
acteristics of the stimulus. 

Neisser (1954) presented Ss a list 
of words to study, indicating that some 
of them would later be shown tach- 
istoscopically. He did in fact show 
some of the original words, but, in 
addition, exposed some that were hom-. 
onyms of other words on the list. He 
found no facilitation of the homonyms, 
even though the original words were 
reported more rapidly. Since a word 
and its homonym require the same re- 
sponse, but only the original word was 
facilitated, he concluded that the pre- 
sentation of the actual word prior to 
exposure must have made the word 
itself stand out perceptually, 

Bruner and Postman (1949) gave 
Ss the set to expect to see normal 
playing cards, but in fact showed them 
cards in which the colors and suits did 
not agree (e.g., black hearts), While 
many Ss apparently ignored the in- 
congruity and reported the cards as 
they would normally appear (e.g., red 
hearts or black clubs or spades), a few 
Ss reported seeing “compromise” per- 
ceptions (e.g., brown or purple hearts). 
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These latter responses have been in- 
terpreted as supporting a perceptual 
hypothesis, since no response facili- 
tation could be expected for a category 
such as purple hearts that S had never 
seen before. 

Bruner (1957b) has offered an in- 
triguing suggestion regarding the re- 
lationship between the two hypotheses 
under discussion. He proposes that 
perceptual experience (the concern of 
the first hypothesis) is the result or 
end product of categorization (the con- 
cern of the second hypothesis), Thus, 
an object is experienced or perceived 
only after it has been properly classi- 
fied—that is, responded to. Prior to 
its identification, the perceptual experi- 
ence of a stimulus is blurred and indis- 
tinct. In the present context, then, 
if the perceiver has a set which facili- 
tates classification of the stimulus, it 
also then facilitates the clarity of its 
percept. 

Bruner and Potter (1964) present 
some evidence relevant to one aspect 
of this notion. They show 5 a common. 
everyday object extremely out of focus 
and then gradually increase the focus 
until it is perfectly clear. During the 
period of increasing focus, S is required 
to give a running commentary of what 
he thinks the picture contains, from 
which the point of correct identification 
can be determined. They found that if 
the focus increases very slowly, so 
that S has extensive viewing during 
which he cannot correctly classify the 
stimulus, he requires a greater amount 
of focus before correct identification oc- 
curs. From this the authors argued 
that the extent to which S misclassifies 
the picture during the very fuzzy pre- 
sentations prolongs the time before he 
can see it clearly, Presumably, during 
this time, S is testing his classifications 
against the fuzzy image, and he has to 
reject the wrong classifications he made 
before he can concentrate on testing 
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correct ones, Only after he tries and 
checks the correct one does he come to 
see the picture clearly. 

_ Obviously, this kind of experiment, 
interesting as it is, cannot provide 
much evidence for as complex a posi- 
tion as Bruner is espousing here. He 
has to make too many additional as- 
sumptions to have a clear-cut analysis. 
À somewhat more relevant, though still 
indirect, test of this notion was re- 
ported by  Hershenson and Haber 
(1965). "These authors had previously 
found that the clarity of letters of words 
increased markedly if they were flashed 
several times, even when on the first 
flash S could see none or few of the 
letters (Haber, 1965a; Haber & Hersh- 
enson, 1965). However, the rate of in- 
crease in clarity for Turkish words was 
slightly faster than that for English 
words. It was assumed that 5 would 
be less likely to misclassify what he 
knew was a Turkish word from only 
partial information, since he knew it 
would be a totally unfamiliar word. 
Therefore, this result was interpreted 
to imply that when S made incorrect 
anticipatory classificatory responses for 
the English words, this slowed down 
the development of their clarity. 

Even given these kinds of data, 
Bruner's notion faces serious theoreti- 
cal problems on other counts, as, for 
example, how one can sec something 
that is novel when he cannot classify 
it correctly. Therefore, until more 
than this meager and indirect evidence 
is available, little else can be said for 
this version of a linkage between the 


two processes. i 

An indepen a 
originally led by 4 51), 
has provided the basis of the Transac- 
tionalist theory of perception. In this 
context, many demonstrations were 
Created to show how an S s assump- 
tions based on past experience oF 
knowledge directly affect his percep- 


tual experience. In one of the most 
striking of these, a physically rotating 
trapezoidal-shaped window appears to 
oscillate back and forth, and to be 
rectangular in shape. This is a very 
powerful perceptual experience, ex- 
perienced by nearly every viewer, so 
that it is difficult to explain in terms 
of any kind of response process. Haber 
(1965b) has recently shown that if Ss 
are given extensive experience with 
the apparatus and knowledge of the 
theory behind the illusion, their per- 
ceptual experience changes so the ma- 
jority of Ss report seeing the nonillu- 
sory rotation of the trapezoidal figure. 
Numerous other examples of these 
demonstrations can also be used to sup- 
port the effects of S’s assumptions 
(set) on his perceptual experience di- 
rectly, in ways suggesting that response 
processes are inoperative. 

Shortly after the beginnings of the 
New Look, strong interest was revived 
in alternative interpretations involving 
response and memory processes. Many 
of the New Look experiments were 
scrutinized for evidence of these proc- 
esses or for faults in design. Accom- 
panying this methodological concern 
was the new development in theorizing 
about methods of research in percep- 


The most important single paper 


tion. 
al. 


in this regard is by Garner et 
(1956). While not taking sides with 
respect to these two hypotheses, Gar- 
ner et al. point out quite persuasively 
the inadequacy of nearly all previous 
research to decide among the interpre- 
tations of the effects of set, as well as 
motivation and implicit instructions, on 
perception. They suggest several non- 
perceptual interpretations of these pat- 
terns of results and present the concept 
of converging operations as an experi- 
mental design to provide tests between 
the two hypotheses. They are particu- 
larly concerned with the possibility 
that the set limits the number of al- 
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ternatives from which S — i 
his response, thus increasing the Lp 
ability that the correct response wi " 
made, independent of any perceptua 
enhancement. Eriksen (1958, 1960, 
1962) has continued this deae : 
arguing that nearly all research in this 
area has failed to differentiate the two 
hypotheses satisfactorily, and that since 
from his point of view the response 
interpretations are more parsimonious, 
there is no evidence to support a per- 
ceptual interpretation. His position 
has received added support from a 
number of recent studies. 

Lawrence and Coles (1954) argued, 
along with Chapman (1932), that a 
perceptual tuning hypothesis has to 
predict that a set given before the 
stimulus, as compared to one given 
afterwards, will provide greater facili- 
tation of report. They go further and 
suggest a dual hypothesis : Perhaps per- 
ceptual enhancement does occur if a 
prior set is given, but, when given 
afterwards, facilitation could be due 
to some selectivity in the memory proc- 
ess. If this were true, then an experi- 
ment like Chapman's would not dis- 
tinguish between the purely perceptual 
and the dual hypothesis, since they 
might both be correct. i 

Lawrence and Coles ( 1954) used 
pictures of single familiar objects, with 
the set administered by projecting on 
the screen, either immediately before 
or after the stimulus exposure, the 
names of four alternatives for the stim- 
ulus. The alternatives were very simi- 
lar to one another for some trials, while 
on others they were quite discrete, One 
group of Ss had no alternatives at all; 
this served as a control on the use of the 
alternatives. 

The results showed that giving al- 
ternatives of either type increased the 
accuracy of recognition of the stimuli, 
as compared with the no-alternative 
control group. Hence, the effect of set 
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was demonstrated. However, alterna- 
tives given after the exposure were 
just as useful as those given before 
(which stands in contrast to Chapman’s 
before-after difference). 

This failure to replicate Chapman’s 
before-after difference was not con- 
sidered crucial, since the equality may 
have been due to perceptual enhance- 
ment from the before set and memory 
sclection from the after set—the two 
processes just balancing each other. 
However, they also failed to find any 
interaction between the before-after 
sets and the type of alternatives (simi- 
lar or discrete). They had argued that 
a perceptual hypothesis would predict 
no difference between the two types of 
alternatives when they are given before 
the presentation, since the correct al- 
ternative would stand out whether 
other alternatives were similar or not. 
On the other hand, since the aíter set 
could not affect enhancement, it must 
affect memory, which should show a 
differential effect of the two types of 
alternatives— discrete ones being more 
effective in facilitating correct recogni- 
tion than similar ones. Failure to find 
evidence in support of this prediction 
(discrete alternatives consistently fa- 
cilitated recognition more than similar 
ones for both before and after sets) led 
Lawrence and Coles (1954) to reject 
any form of the perceptual tuning hy- 
pothesis, and to argue that the effect of 
set was via some sclective facilitation 
of memory processes. 

By implication, Lawrence and Coles 
provided a possible explanation for 
Chapman's before-after difference —the 
one they failed to replicate. Under the 
assumption that the entire effect of set 
is caused by memory and response 
processes, the earlier the set is given. 
the greater the opportunity it will have 
to provide facilitation of these proc- 
esses. Thus, even if there is no per- 
ceptual tuning, administering the set 
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prior to the exposures gives S just 
that much more time to organize his 
memory. Sets given after the stimu- 
lus presentation have to act on an al- 
ready fading memory trace and are 
at a disadvantage compared to a set 
given before the trace has weakened. 
As the delay between the exposure and 
the set increases, the superiority of the 
prior alternatives should increase. 
Since Chapman had a longer delay 
than Lawrence and Coles, this slight 
time difference may have accounted for 
the different outcome in the two ex- 
periments. 

An interrelated group of studies by 
Long, Reid, and Henneman (1960), 
Long, Henneman, and Garvey (1960), 
and Reid, Henneman, and Long (1960) 
has used a model of set as a response 
limiter, similar to Garner et al.'s 
(1956) notion, whereby set "increases 
perceptual accuracy by increasing the 
probabilities of certain responses and 
decreases the probabilities of others in 
the various response classes [Long, 
Reid, & Henneman, 1960, p. 554]. 
Most of their experiments used single 
letters as stimuli, degraded in clarity, 
with set manipulated by giving $a 
group of alternative letters. of which 
the stimulus letter would be one. The 
alternatives were given either before 
or after the exposure. They found 
that decreasing the number of alterna- 
lives increased the accuracy, regardless 
of whether the alternatives Were given 
before or after the exposure. From 
this they concluded, as did Lawrence 
and Coles (1954), that set was acting 
on the selection of responses, and ust 
On perception itself. Greater accuracy 
was also found if the alternatives were 
drawn from an exclusive subset rather 
than from the entire alphabet. ar 
ther, they showed that set effects cou 
be produced by restriction of the cues 
Eories (rather than alternatives) into 
Which the stimulus might fall. This is 


somewhat similar to the Postman and 
Bruner (1949) finding, but was found 
to be true regardless of whether thé 
restriction was given before or after 
the stimulus. This greatly weakens 
Postman and Bruner's contention that 
restriction of category affects the per- 
ception of the stimulus itself. All of 
these findings are consistent with a 
response-limiting interpretation of set. 
However, Long, Reid, and Henne- 
man (1960) also found and replicated 
several before-after differences. One 
was found when only two alternatives 
were provided, so that being told the 
stimulus was one of two particular let- 
ters before the flash helped S report 
the letter more accurately than if such 
a set was given 1 or 2 seconds after the 
flash. Another before-after difference 
d in an auditory version of their 
experiments, when four alternatives 
were presented either before or after 
the stimulus letter was spoken in noise. 
Finally, when two degraded letters 
were presented and S was asked which 
of the two was a specific alternative, 
they found that specifying the search 
letter before the presentation facili- 
tated correct discrimination, as com- 
pared to a later specification. - . 
Each of these findings is inconsis- 
ith the response interpretations 
of set, and it is not clear how they 
might be explained other than by some 
form of perceptual enhancement. The 
Lawrence and Coles’ explanation—that 
the earlier the set is given, the greater 
its opportunity to facilitate memory 
rocesses—will not work, since it 
should follow that the greater the num- 
ber of alternatives, the greater the ef- 
fect of an earlier set. "Increasing the 
alternatives. implies that the number 
of discriminating characteristics be- 
tween them become more restrictive, 
and thus are more likely to be lost in 
the trace [Lawrence & Coles, 1954, p. 
213].” However, these findings, in- 


occurre 
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š B i reta- 
consistent ii a eh n iw al 
tion, occurred primarily ifi 
ternatives, and no before-after di = 
ences were found with larger number o 
ice line of research, Taub 
(1965), following the work of Teich- 
ner, Reilly, and Sadler (1961) and 
Teichner and Sadler (1962), pre- 
sented an array of letters to S, asking 
him to report all of them. The number 
of letters in the array, the exposure du- 
ration, and the relative value of each of 
the letters (set) were varied. Value 
was manipulated by assigning points 
for correct reports of letters, with dif- 
ferential value attached to letters from 
the first and second half of the alpha- 
bet. Taub found that value affected ac- 
curacy (a set effect), but primarily 
through a reduction in accuracy of the 
low-valued letters, rather than an in- 
crease in the high-valued ones. This 
occurred in spite of the finding that the 
first letter reported from an array was 
most likely to be one of high value. 
Taub’s (1965) experiment is most 
relevant to the issues of this discussion 
hecause of the similarity of a prediction 
he makes to the Lawrence and Coles’ 


(1954) similar-discrete difference pre- 
sented earlier: 


. if a major selective process in perception 
was involved, then the size of the difference 
between high- and low-value letters should 
vary inversely with length of exposure. 
That is, a perceptual set, which would make 
the higher value categories easier and quicker 
to identify (Bruner, 1957), would occur at 
those conditions where there is little time 
to detect and discriminate between the cate- 
gories, but have little effect at longer 
exposures where all letters would be equally 
discriminable. Predictions from a selective 
recall point of view would be the opposite, 
That is, as exposure time increases and the 
rate of memory loading decreases, Ss would 
have more time to selectively code the 
categories which are detected. Thus, a 
selective memory process explanation would 
predict increasing differences in response to 
value as exposure time increases [p. 142]. 
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Since Taub found unequivocal evi- 
dence that increasing exposure time 
increased the differences in accuracy 
between high- and low-value words, he 
rejects a perceptual tuning explanation 
in favor of some variety of the memory 
hypothesis—especially a selective-re- 
call one. However, Taub used rela- 
tively long exposure durations (500- 
2,500 milliseconds), so that visual 
searching clearly occurred for the 
longer presentations. Therefore, his 
data are equivocal with respect to this 
issue, since S has opportunity to seek 
out the high-value items if given suf- 
ficient time, and this would imply noth- 
ing about the effects on perception at 
all. Thus, it appears that, while the 
prediction is relevant, it requires a pre- 
sentation time considerably shorter 
than Taub used to test it adequately. 
A different type of experiment was 
reported by Lawrence and LaBerge 

(1956) in which they were concerned 
with the response hypothesis of a fading 
memory. The stimuli were pairs of 
cards from the Wisconsin Card Sorting 
Task, with an attention-instruction set 
given only before the exposures, by in- 
dicating the relative value for each 
attribute correctly reported. For one 
group of Ss, all three attributes (num- 
ber, color, shape) received equal value; 
for another, one attribute was always 
emphasized by offering 100 times as 
much money as each of the other two 
attributes. All three attributes had 
to be reported, but S determined the 
order. A third group, with equal set, 
had the order in which the three at- 
tributes were to be reported controlled 
and varied from trial to trial. 

_When the emphasis and equal con- 
ditions were compared, they found that 
the effect of set appeared to increase 
accuracy for the attended-to attribute 
and decrease it for the other two. They 
also found that the decrement from 

TSt to third report under the equal 
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conditioni was about the same magni- 
hers w the difference between the 
m Sa and incidental attributes in the 
Sok hasis condition, While they did 
ee measure the order of report when 
nequal attention instructions had been 
es Lawrence and LaBerge (1956) 
ee that the set effect could have 
i en entirely determined by the order 
Ti Which S reported the attributes. 
bn is, if, whenever one attribute was 
"Ege: S always reported that one 
Bees then the accuracy of the critical 
attribute would be higher because it 


h: 
dad been reported first. 


Thus, ton 
^. Lawrence and LaBerge sug- 
gest that no assumption need i made 


about reorganization of memory or 
changes in response probabilities, but 
only that all memory for the stimulus 
is slowly fading. 
first will be more accur 
reported items. This hypo 
very similar to one Wilcocks 
tested but rejected. Lawrence an 
LaBerge cannot test this in their data 
since thev did not measure the pre- 
ferred order of report when the atten- 
tion instructions made one attribute 
critical. . ; 

This test was provided in experi- 
ments by Harris and Haber (1963) 
and Haber (19642). The same stimu- 
Jus materials and procedures for in- 
ducing sets were used, but each S was 
tested under all set conditions, includ- 
ing a forced order of report spect ed 
by E on half of the trials. and 
order on the other half. S could report 
the stimulus in whateve 
(though he had to indicate t 


when just the 
was analyzed, i 
a critical attribute, !* , 
curately reported than ! 
incidental one. The Sam wa cpi 
the second and third reports. us, 
Lawrence and LaBerge (1956) were 
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incorrect when they argued that the 
set effect could be explained in terms of 
order of report. Holding order of re- 
port constant, the set effect was still 
obtained. Harris and Haber (1963) 
did find that some Ss reported the 
critical attributes first if they were 
allowed to choose their own order of 
report. But even granting this, it did 
not account for the set effect. Thus, 
this simpler hypothesis of Lawrence 
and LaBerge was shown to be inade- 
quate. A 
While the findings of Harris and 
Haber (1963) and Haber (1964a) in- 
dicated that fading memory coupled 
AS o varying order of report could 
not explain Vo sS. ees LS dis- 
cover what may be 4 more ess 
determiner of set—namely, the encod- 
ing processes By wich S tropsyates We 
percept of the stimulus into memory. 
Some pilot work had indicated that 
nearly all Ss spontaneously encoded 
this type of stimulus into words and 
silently rehearsed the message repeat- 
edly until all of the attributes had been 
reported. Further, with these stimuli, 
it was found that Ss used one of two 
dissimilar strategies to accomplish this 
encoding. Some Ss encoded it by 
separating the stimulus into objects on 
the left and right (e.g, one red tri- 
angle, three blue stars). The remain- 
ing 55 encoded it by separating the 
its three dimensions (e.g. 


red blue, triangle star, one three). The 
order of the dimensions in this latter 
Dimensions code could of course be 
varied without disturbing the strategy, 
rmer Objects coding strat- 


es order of encoding fixed by 


To investigate i 
ential coding strategy on perception, 
d 


arris an Haber 
Hoa ined Ss to use one of the 
ies. In both of these 


experiments, an effect of set and of or- 
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der of report was found. (Emphasized 
dimensions were reported more a 
curately than unemphasized dimen- 
sions, holding order of report con- 
stant, and the dimensions reported first 
were more accurate than those reported 
later.) However, both of these effects 
strongly interacted with coding strat- 
egy. So strongly, in fact, that nearly 
all of these effects were accounted for 
by the performance of those Ss using 
the Dimensions code. Thus, those Ss 
who used the Objects code, based on 
English syntax, were not significantly 
more accurate on the emphasized, as 
compared to the unemphasized, dimen- 
sions. Further, regarding the order 
of report effect, as shown in the repli- 
cation study (Haber, 1964a), the dif- 
ference between the early and later re- 
ported dimensions was smaller for 
those Ss using the Objects code than 
for those using the Dimensions code. 
Finally, Ss using the Objects code had 
a greater overall accuracy of report, 
regardless of condition. 

It seems clear therefore that the set 
and order of report effect can in large 
part be explained by the operations of 
the coding strategies themselves. These 
results suggest that the set effect stems 
as much from a change in the initial or- 
ganization of the memory of the stimu- 
lus produced by the strategy by which 
S encodes the stimulus, rather than 
from a facilitation of contact with an 
already organized memory or through 
the effects of a fading memory or 
through the changes in the probability 
of the correct response being available. 

However, the results on coding strat- 
egy do not indicate the reasons for the 
differences between these two coding 
strategies. To assess these, Haber 
(1964b) conducted a third study, using 
the same stimuli, strategy-training pro- 
cedures, and set manipulation, but re- 
quiring S to verbalize his encoding and 
rehearsal of the encoded stimulus so 


it could be tape-recorded. Each S was 
instructed to begin talking immediately 
after the flash with his verbal encoding 
of the stimulus, and to repeat this en- 
coding over and over for 20 seconds. 

Haber found that Ss using the Ob- 
jects code could begin encoding con- 
siderably faster and complete encoding 
more rapidly than Ss using the Dimen- 
sions code. Thus, the two strategies 
differed sharply in the speed with 
which they could translate the stimulus 
into memory. If there is some after- 
image or short-term memory of the 
stimulus, lasting perhaps several hun- 
dred milliseconds beyond its offset 
(Averbach & Coriell, 1961; Mack- 
worth, 1963; Sperling, 1960), then the 
strategy that can translate this image 
into memory most rapidly is more 
likely to have that encoding based on 
a good image that will allow it to be 
more accurate. In this way, one can 
account for the difference in overall 
accuracy between the strategies. Simi- 
larly, if one strategy encodes slower 
than the other, then presumably the 
items it encoded last would be poorer 
in accuracy, which would suggest that 
the order of report effect could also be 
accounted for by the differences in the 
speed of encoding. Finally, it was 
found that the Dimensions coders 
nearly always encoded the critical di- 
mension first, while the Objects coders 
always followed the same order—that 
of English syntax. Thus, the set vari- 
able can also be explained by the 
differential speed of encoding. Tf the 
attributes encoded first are always the 
critical ones, leaving the others to be 
encoded later from a rapidly fading 
image of the stimulus, it is not sur- 
prising that the critical attribute is 
reported more accurately. This does 
not depend upon order of report, as 
Lawrence and LaBerge thought, but 
on order of encoding. : 


The two strategies also differed in 


EFFECT OF 


the frequency of errors made after the 
initial encoding, with transpositions, 
intrusions, and omissions occurring 
significantly more frequently during 
the rehearsal process in the Dimensions 
code. This could further account for 
the order of report effect—the items 
reported later would be based upon 
a rehearsed encoding that had under- 
gone some erroneous change. This 
Would be true whether S was reporting 
the stimulus attributes in an order he 
chose or in one forced upon him by the 
experiment. 

This experiment suggests, then, that 
the coding strategies differ in two im- 
portant ways: the speed of encoding. 
and the resistance to errors during re- 
hearsal. It further suggests that these 
two differences could account for both 
the effect of set and the effect of the 
order of report in the experiments by 
Lawrence and LaBerge (1956). Har- 
ris and Haber (1963). and Haber 
(19642), as well as by implication some 
of the earlier experiments discussed 
above. For example, the study by 
Neisser (1954) can now be explaine 
in terms of encoding, without having to 
make any assumptions about the nature 
of the percept. This alternative €*- 
planation suggests that while the Te 
Sponse given to the homon 
know is identical, t 
those two words into memor 
be quite different. 
difference could 
faster for words one 
Thus, encoding of 
KNow begins later. 
therefore uses à 
degraded trace. While there ?*; E 
no direct test of this expl du 
Seems reasonable that the more pe 
Pared S is for what he will see. the 
more likely it is that he wi 
appropriate strategy n 3 
available in which to encode the stimtt- 
us, 


the 
takes longer. 
rogressively more 
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It should be noted that variation in 
encoding is not the only nonperceptual 
interpretation of the Neisser (1954) 
experiment and of experiments similar 
to it. Since Neisser's Ss are in a 
guessing situation, where they must 
guess the word from presumably only 
fragments of the letters given percep- 
tually (at least on the first few flashes), 
they will have more success guessing 
those words they were given a set to 
see (eg, NO will be easier to guess 
than its homonym KNOW, which they 
had not experienced in the preliminary 
training). This interpretation has 
been used by Eriksen and Browne 
(1956) in a related type of study. It 
says nothing about increased clarity of 
the stimulus caused by the prior set, 
but only that such a set may facilitate 
e correct word when only 
f it are seen perceptually. 

The encoding analysis advanced 
above is à more specific hypothesis 
about the outcome of the Neisser ex- 
periment than the more general guess- 
ing one. It attempts to specify the 
translation of the visual image, as it is 
generated on the retina by the stimulus, 
jnto some kind of memorial trace or 
persistence after the stimulus termi- 
While the visual image may 
a brief time after the stimu- 
in the form of a short-term 
it is crucial that perceptual 
e into account the processes 


guessing of th 
a few pieces O 


memory, 
theory tak 


that occur 
ory has faded. Since it is obvious that 
memory for stimuli persists long after 
the stimulus (and any attendant short- 
memory and afterimages) has 
no perceptual theory can 
gnizance of this 


term 
terminated, 
be complete without co 


ersistance of memory. 
Specifically, the encoding explana- 


tion being discussed here suggests that 
most visual stimuli are remembered by 
being encoded into previously learned 
linguistic units. usually words. This 
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encoding takes place while the args 
(or its brief short-term pep É- 
still present, and whatever 5 
been encoded after the stimu = s 
faded is entirely lost and not avai A 
in permanent memory. This is apos 
deny that a few Ss may use some r : 
linguistic code, especially for nonlin- 
guistic stimuli. However, certainly 
with respect to the kinds of materials 
of greatest interest to psychologists, 
encoding into words is the most prob- 
able basic strategy. 

Variation in the adequacy of encod- 
ing can presumably depend upon a 
number of variables. The experiments 
reported above on encoding of Wiscon- 
sin Card Sorting Task concept cards 
suggest that when a number of highly 
familiar dimensions of a stimulus must 
be encoded quickly, accuracy of encod- 
ing depends upon whether S has at his 
disposal a strategy that lends itself to 
rapid coding, preferably without having 
both to learn the code as well as per- 
form the encoding operations. Thus, 
using a highly overlearned syntax for 
the order of encoding is preferable to 
one for which S must think about what 
should be encoded next. 

In many perceptual experiments, the 
problem is not one of speed of encod- 
ing familiar dimensions, but of en- 
coding degraded and often unpredict- 
able stimuli. For example, a typical 
trial of a word-recognition experiment 
presents S with a fragmentary flash 
which first has to be maintained in 
memory and then that memory com- 
pared with possible alternatives of what 
the original stimulus must have been. 
If S has no adequate way of maintain- 
ing the fragments in memory, he will 
not be able to test any hypotheses 
about the stimuli once it terminates. 

Thus, in most perceptual situations, 
two sources of variation are present — 
the adequacy of encoding a visual im- 
age, often imperfect, into memory, and 
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the success of guessing what stimulus 
gave rise to those remembered frag- 
ments. To take an extreme though not 
too uncommon instance, stimuli for 
which S has no code for translation 
into memory are notoriously difficult to 
remember, even though S may per- 
form perfectly on simultaneous discrim- 
inations with such stimuli. That is, 
he can make all of the appropriate 
responses to the stimulus while it is 
present, but cannot reconstruct it after 
it terminates. Most people experience 
this difficulty in describing colors, 
tastes, odors, and feelings from recol- 
lection. It is usually suggested that 
they do not have the appropriate words 
or other codes with which to maintain 
them in memory. These examples 
represent instances of excellent match- 
ing between the stimulus and prior 
experience, but very poor encoding of 
the stimulus into memory. Thus, 
short-duration presentations or delayed 
responses would show poor perform- 
ance. One exception to this is found 
for those few people with eidetic im- 
agery (see Haber & Haber, 1964) who 
are capable of maintaining a nearly 
perfect visual image of the stimulus 
for many minutes. These eidetic per- 
ceivers seem to do little if any encoding 
of the stimulus while the image persists, 
so that once it fades their memory of 
the stimulus is little better than that of 
more typical individuals. 

Evidence relating to this specific 
distinction between encoding the per- 
cept and then matching the resultant 
memory to prior knowledge is very 
sparse, though it is beginning to ac- 
cumulate, For example, McKinney 
(in press), by presenting single letters 
continuously and asking S to report 
changes or fragmentation in the percept 
of the letter, found that if the letter was 
embedded in a series of other letters 
which S was labeling as letters, much 
less change was reported than if it was 
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embedded in a series of geometric 
designs for which no labels were sug- 
gested. — Thus, the stability of the 
percept is increased when the object 
can be labeled (an operation probably 
equivalent to encoding). ý 
" aa (1965), following work 
y Conrad (1964), presented lists of 8 
symbols, selected so that matched pairs 
of them would be auditorily confusable 
(eg, 2 with Q, 3 with T, etc.) 
Analyses of errors showed that audi- 
tory confusions were far greater than 
chance, suggesting that Ss were main- 
taining these symbols in memory by 
some type of auditory encoding and 
rehearsal. This was explicitly found 
When, for the half of the trials on which 
White noise was used to prevent S from 
hearing his own thoughts, S produced 
overt rehearsal. However, Hintzman 
also reports that some Ss did not use 
rote aural encoding, but rather at- 
tempted to classify the string of sym 
bols (e.g., to make up @ mathematical 


formula). Such Ss made few aural 
confusions. Thus, the nature of the 
coding adopted by 5 determined the 

" This, of 


pattern of errors made. 
re the sym- 


course, was a situation whe 
entation was 


bols from which each pres 
in advance. 


drawn were well known 
Hence the matching of ? 
memory to prior experience was pre 
sumably perfect, with all of the vart- 
ance in the encoding of y 
Gruber, Kulkin, and 
(1965) presented a paired 
learning task in which half o 
were instructed to attempt o 
visual image between the t 
of each pair. These Ss 
recall scores for all expos" 
ranging from 1-24 secon s. 
again, it would seem that aids for €n- 
Coding of perceptually presented fa- 
miliar material increase the accuracy © 
its encoded representation in 
The investigation and ™ 


-associate 
f the Ss 
form a 
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of coding variables in general in the 
study of perceptual and cognitive proc- 
esses is becoming more extensive and 
is playing a far greater role in theory 
than 5-10 years ago. Much of the 
impetus for this has been Miller's 
(1956) paper on coding effects in in- 
formation processing, although scat- 
tered work was reported before (e.g. 
Fitts, 1954; Pollack, 1952). More 
recently, however, à number of im- 
portant experimental and theoretical 
papers have applied encoding processes 
to a wide range of perceptual data and 
problems (e.g. Broadbent, 1958; Con- 
rad, 1964; Glanzer & Clark, 1962, 
1963; Mackworth, 1963; Miller, Gal- 


anter, & Pribrim, 1960; Sperling, 


1963). 
The three studies on coding strate- 


gies by Haber have shown the impor- 
tance of coding processes in the investi- 
gation of the effects of set. They 
suggest the possibility that many of the 
effects of set on reports of perception 
can be explained by differential en- 
coding processes. Many studies have 
reported that the emphasized attribute 
of a stimulus can be reported more 


accurately than other attributes. irre- 
spective of when the instruction. for 
However, in the 


emphasis is given. 
Haber studies, this finding is dependent 
the nature of the encoding 
strategy—only one type of strategy will 
produce this emphasis-set effect. Simi- 
type of strategy leads 

Thus, 
(and 


upon 


restriction o 
a set effect may depend upon whether 


's encoding strategy is $ 


such à manipulation. —, — . 
An alternative possibility 15 that on 


the trials when the set is given to S, 
he changes his strategy $0 as to maxi- 
mize the payoff from the set. In the 

only a few Ss 


Haber experiments, 
varied their strategies from condition 
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ition—too few to provide any 
doge pent of their performance. 
But this is a very likely possibility, par- 
ticularly if the various set conditions 
make quite different demands upon Ss 
reports. Since the studies by Haber 
are the only ones that have shown this 
higher-order dependency of set on en- 
coding, further research will be needed 
before these possibilities can be ana- 
lyzed successfully. 


CONCLUSIONS 


Evidence on the effects of set has 
been reviewed with respect to two 
basically different hypotheses—set en- 
hances the percept of the stimulus while 
S is actually viewing it; set facilitates 
report of the stimulus without affecting 
its percept. 

At least three varieties of nonper- 
ceptual hypotheses were discussed: 
(a) response-limiting or response- 
probability changes, (b) order of re- 
port changes coupled with a fading 
memory, and (c) reorganization of the 
memory process itself. The second 
alternative does not seem to be a neces- 
sary condition for the production of set 
effects, since even when order of report 
is controlled the effects of set are still 
found without loss in magnitude. The 
supporting evidence for the response- 
probability interpretation is extensive, 
in that the magnitude of the set effect 
varies with the manipulation of the 
probabilities of responses or limitations 
on responses. The third alternative, 
reorganization of the memory process 
mediated by S's coding Strategy, is 
strongly supported in the results of 
three studies by Harris and Haber 
(1963) and Haber (19642, 19645). 
Further, Haber's interpretations of 
those results suggest that the response- 
probability explanation may be reduced 
to memory reorganization (encoding), 
so that only one nonperceptual hy- 
pothesis may be needed. 
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While this review then suggests a 
narrowing of the number of nonper- 
ceptual alternatives, it provides less 
resolution to which of the two basic 
hypotheses is correct. The problem 
of analysis is still very complex—many 
of the experiments discussed provided 
clear-cut evidence for response or 
memory explanations, but without 
simultaneously demonstrating a lack of 
a perceptual effect. That is, finding 
evidence in favor of one hypothesis 
does not disprove the other, since they 
are not incompatible, only alternative. 
There is no reason to doubt that both 
may be correct and occurring together. 

It is likely that a set effect created 
by differential value (eg., Harris & 
Haber, 1963; Kiilpe, 1904; Lawrence 
& LaBerge, 1956) is mediated pri- 
marily or perhaps totally by character- 
istics of S's coding strategy. How- 
ever, even with this, the Harris and 
Haber (1963) and Haber (1964a, 
1964b) experiments do not rule out 
perceptual changes, since they had no 
converging operations sufficient to sep- 
arate percept from coding effects. 
Their argument took the form that 
the set effect was present only in SS 
using one type of strategy and not in 
Ss using another—therefore the set 
effect must have been due to properties 
of the strategy. Further, when the 
Strategies themselves were examined, 
temporal and interference processes 
were found that were sufficient to ac- 
count for the differential set effects 
between the two strategies, though 
again, this does not rule out perceptual 
effects completely. . 

So what conclusions does this evi- 
dence permit? Most Ss feel an in- 
crease in perceptual clarity occurs 
under appropriate set conditions, 
though their reports give no guarantee 
that they can really distinguish felt 
clarity from a better memory. On the 
other hand, there is no reason to doubt 
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their reports either. Just because the 
evidence supporting response processes 
is clear does not imply that perceptual 
enhancement does not occur. Further, 
Bruner and Postman’s (1949) finding 
of compromise perceptions and the 
strength of the trapezoidal illusion, to 
pick just two examples, make it difficult 
to doubt that at least some kinds of 
sets affect perception directly. 

_ Therefore, this review must conclude 
inconclusively with respect to à choice 
between the two hypotheses. Some 
evidence exists to support each of 
them, and some exists which favors one 
over the other. But there is none that 
Supports one while disproving the 


other. The issue is still crucial in 
perceptual theory, and obviously much 
eded. 


more very careful research is ne 
There is adequate demonstration now 
of the effects of set on both perception 
and responses. It still remains to show 
the conditions under which each will 


occur, 
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THEORETICAL NOTES 


HABITUATION AND STIMULUS NOVELTY: 
A MODEL BASED ON CLASSICAL CONDITIONING 


LARRY STEIN! 
Wyeth Laboratories, Inc., Philadelphia 


A theory of habituation is described which assumes that: (a) The 
arousal reaction is controlled by antagonistic excitatory and inhibitory 
brain mechanisms; (b) a novel stimulus activates the excitatory 


mechanism which in turn activates the inhibitory 


mechanism; and 


(c) the inhibitory process is conditionable. The inhibitory mechanism 
becomes conditioned (by Pavlovian conditioning) to the onset of the 


stimulus itself after repeated presentations. 


Habituation of arousal 


is said to occur when the anticipatory, conditioned activation of the 
inhibitory mechanism overrides the direct activation of the excitatory 
mechanism. Reactivation of the arousal reaction when the habituated 
stimulus is changed is explained by a reduction in conditioned inhibitory 
activity due to stimulus-generalization decrement. 


When a signal is presented repetitively 
to an organism in a well-controlled en- 
vironment, response to the signal habitu- 
ates. On initial presentation, the signal 
elicits an investigatory reaction and other 
signs of behavioral alerting. Subse- 
quent presentations of the stimulus lead 
to progressive attenuation of the arousal 
pattern until it becomes very weak or dis- 
appears altogether (Sharpless & Jasper, 
1956). 

Habituation of the arousal reaction is 
not due to simple fatiguing of sensory or 
effector elements. The process clearly 
does not depend on effector fatigue since 
a response that has become habituated 
to a familiar stimulus may still be elicited 
by a novel stimulus of equal intensity. 
It is also clear that habituation does not 
depend on sensory fatigue or nerve ac- 
commodation since (a) it develops even 
when long periods of rest intervene be- 
tween presentations of the stimulus, and 
(b) the orienting response may be re- 
stored, after habituation is developed to 
an intense stimulus, by suddenly reducing 
the intensity of the stimulus (Sokolov, 
1960). Since a mild stimulus could 


1The helpful comments of M. E. Bitter- 
man, P. L. Carlton, and R. F. Thompson are 
gratefully acknowledged. 
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hardly activate a sensory receptor fatigued 
by strong stimulation, Sokolov’s experi- 
ment plainly shows that the arousal reac- 
tion depends on stimulus change or 
novelty, and that habituation develops as 
stimuli become familiar. It follows. 
therefore, that the brain must have some 
means of distinguishing between novel 
and familiar stimuli in these situations. 

Tt is commonly supposed that the detec- 
tion of stimulus novelty requires that 
some representation of prior stimulation 
be coded and stored in the brain so that 
incoming signals can be matched against 
the stored signals for similarity. If an in- 
coming signal matches one of the stored 
signals, the arousal response is sup- 
pressed; if no match occurs, the arousal 
response is evoked (Sokolov, 1960). Al- 
though it is conceivable that the brain can 
in fact store all prior input, scan the store 
with each incoming signal, and perform 
the required matching operation, still it 
seems that such a hypothesis assumes as 
much as it explains, Other assumptions 
will be required as well. For example, an 
additional mechanism would be needed to 
classify and index the store of coded sig- 
nals. Otherwise, the time required to 
scan the store would increase as the 
store is enlarged by experience, and one 
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1s led to the improbable conclusion that 
detection of stimulus novelty should take 
longer as organisms grow older. 
x ird event, it is the aim of the pres- 
t paper to suggest an alternative mech- 
anism for the detection of stimulus novelty 
in habituation experiments that does not 
Fegnire the brain to perform complex 
scanning, matching, and indexing opera- 
tions, The present hypothesis, based en- 
tirely on classical conditioning principles, 
derives chiefly from the suggestive com- 
ments of Pavlov (1927, P- 256) on the 
disappearance of the investigatory reflex, 
but also contains elements from other 
theories (Sokolov, 1960; Thorpe, 1956) 
Following Sokolov (1960), it is as- 
sumed that two antagonistic neural sys- 
tems are sequentially activated by the 
initial presentation of a stimulus. Activa- 
tion of the first system excites OT facili- 
tates the arousal reaction, whereas activa- 
tion of the second system inhibits the 
arousal reaction, One may assume that the 
excitatory system is activated by the onset 
of the novel stimulus and the inhibitory 
system by its prolongation (Sokolov, 
1960) ; or better, one may assume that the 
onset of the stimulus activates the ex- 
citatory system, which in turn engages 
the inhibitory system. This sort of 
negative-feedback arrangement resembles 
in formal characteristics, though not in 
time course, the recurrent inhibitory path- 
ways of Eccles (1964, PP: 209-215) and 
would serve to decrease the sensitivity © 
the arousal mechanism for 2 period © 
time after it is activate For this 
reason, perhaps, the arousal 
habituates more rapidly if 
of the signal are massed and tends to 
recover over time (spontaneous 
—Thompson & Spence" 1966). 
Case, the discussion 
the sake of simplicity» 
elapses between trials to 2 
ual inhibitory activity tO 
It is further assume 
of the inhibitory system 15 
in the Pavlovian sense, 
the signal. If so, o the next P 
tion, the signal will 
Systems simultaneously 
quentially—the excitatory 5 


as before * and the inhibitory system 
anticipatorily by conditioning. The ex- 
citatory system will in most cases be ace 
tivated more vigorously than the inhibi- 
tory „system, since only one trial of 
inhibitory conditioning will have preceded 
the presentation of the stimulus. The 
arousal reaction therefore will again be 
evoked, but its strength will be somewhat 
diminished. However, the second presen- 
tation of the stimulus will constitute 
another inhibitory conditioning trial. 
Hence, on the third presentation of the 
stimulus, the inhibitory system will be 
activated still more rapidly and vigor- 
ously, and the resulting arousal reaction 
will be correspondingly weaker. Further 
presentations of the signal will continue 
the process until the conditioned activity 
of the inhibitory system finally overrides 
the directly evoked activity of the excita- 
tory system. At this point, the arousal 
reaction will no longer occur at all and 
will be said to be habituated. A diagram 
of these processes is given in Figure 1 

What if a new stimulus is presented 
instead of the habituated one? The ex- 
citatory system will be directly activated 
as before, but the inhibitory system will 
not be activated anticipatorily since con- 
ditioning to the new stimulus has not 
taken place- The new stimulus will 
therefore evoke an arousal reaction. 

If the new stimulus is similar to the 
habituated stimulus, the inhibitory sys- 
tem may be partially activated as à result 
of stimulus generalization. In such à 
the probability or vigor of the 
reaction will be inversely related 
oree of similarity between the 


Apelbaum et al. (1960) al- 
The important 


oint to be note! is that any 
in the habituated stimulus (in- 
s intensity, as in 


cluding @ decrease in it! 
produce à 


Sokolov’s experiment) can 


2 Jt is simplest to assume that the excita- 
tory sys i ivated in the same strength 
each ti is presented, regardless 
of the degree of hi bituation of the arousal 
response. If decrement 1S assumed, the 

i Excitatory activ- 


theory becomes unwieldy. t ] 
ity W however, on the intensity and 


ill depend, I h 
other characteristics of the stimulus. 
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FEN a 


TRIAL I 


EEG or Behavior 


SIGNAL 


TRIAL N 


(habituated) 
EEG or Behavior 


TIME ————____. 


na Llo 


—— DIRECT ACTIVATION 


Fic. 1. Diagram of conditionin 
vates excitatory brain mechanism, E, 
Arousal response is elicited while ex 
as inhibitory activity grows; further: 
of the arousal response will be ele; 
activation of inhibitory mechanism, 
of inhibitory mechanism to 
nism; hence, arousal responsi 
conditioning also takes pla 
below-zero habituation.) 


stimulus-generalization decrement in con- 
ditioned inhibitory activity and therefore 
permit the arousal reaction to reappear. 
According to this theory, then, stimuli 
in habituation experiments are familiar 
to the extent that they have been condi- 
tioned to a mechanism that inhibits the 
arousal reaction. Completely novel stim- 
uli have no conditioned tendency to ac- 
tivate the inhibitory system, whereas 
stimuli that are very familiar can activate 
the inhibitory system so rapidly and in 
such strength that no arousal reaction 


Ses eoi a imn iem * CONDITIONED ACTIVATION 


g model of habituation, 
which in turn directly 
citatory activity pred 
more, because inhibitio 
ated for a short time, 
conditioning occurs. 

onset of signal overrides dir 
€ fails to occur and is said t 
ce on this trial despite absi 


(TRIAL 1: Signal directly acti- 
activates inhibitory mechanism, I. 
ominates and declines in strength 
n outlasts excitation, the threshold 

Since signal onset just precedes 
TRIAL N: Conditioned activation 
ect activation of excitatory mecha- 
o be habituated. Further inhibitor y 
ence of arousal response, producing 


takes place at all. One should not as- 
sume, however, that stimulus familiarity 
depends exclusively on inhibitory condi- 
tioning of the sort described above. For 
example, if a signal is paired with a 
primary reinforcer, the signal may ac- 
quire familiarity by being conditioned to 
a neural reinforcing system, The general 
rule, perhaps, is that stimuli seem familiar 
to the extent that they can evoke condi- 
tioned reactions. 

Thompson and Spencer (1966) in a 
recent review of the habituation literature 
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me nate he clet habituation phe- 
18-20) > ımmary statement (pp. 
mn provides a convenient test of the 
ne power of the theory. Some 
Pe (e.g, spontaneous recovery, 
s us generalization) have already 
en discussed; others are treated below. 
f One key finding, "ubiquitous as habitu- 
son itself" (p. 19), is that dishabituation 
;ccurs when an additional (usually 
Strong) stimulus accompanies the habitu- 
i stimulus. According to the present 
aap such dishabituation would happen 
of me reasons. First, direct activation 
c he excitatory mechanism would in- 
Tease because of the additional input 
ion by the dishabituating stimulus. 
"ee oris conditioned activation. of 
d i inhibitory mechanism would decrease 
it is correct to assume that a stimulus 
complex consisting of the original stimu- 
lus plus the additional stimulus differs 
enough from the original stimulus alone 
to produce a stimulus-generalization 
decrement. Upon repeated application of 
the additional stimulus, of course, habitu- 
ation to the stimulus complex would 
develop in the usual manner. 
Another finding that has r 
planation is the observation 
tional habituation trials give 
arousal response has disappeared will re- 
sult in further response decrement (as 
indicated by slower recovery OF some 
such measure). Such “below-zero” 
habituation is puzzling because it sug- 
gests that elicitation of the arousal re- 
sponse is not essential for the develop- 
ment, or continued development, © 
habituation. A strong feature of the 
Present theory is that it is able to handle 
this phenomenon. The critical assump- 
tion is that the excitatory mechanism, 
and not the arousal response serves as 
the “unconditioned stimulus" for the In- 
hibitory process thought to underlie habit- 
he arousal 


uation. And although 
in strength and 


esisted ex- 
that addi- 
n after the 


response diminishes 
eventually  habituates the excitatory 
mechanism does not (see Footnote 2) 
s unconditioned 


quate on 
conditioning on 


f the degree of 


This ensures an ade 
stimulus for inhibitory 
every trial regardless ° 
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W 


habituation. Since inhibitory condition- 
ing can proceed even after the arousal 
response disappears, continued presenta- 
tion of signals can produce below-zero 
habituation. 

Finally, what may be said about pos- 
sible anatomical and biochemical sub- 
strates of these hypothetical arousal-regu- 
lating mechanisms? There is now 
general agreement that the reticular 
formation of the brain stem is the focus 
of an excitatory mechanism for arousal, 
and that this mechanism is under adrener- 
gic control (Magoun, 1963b). The 
focus of the inhibitory mechanism is much 
more uncertain. It could conveniently 
be located in the reticular formation itself 
(Moruzzi, 1964), but other sites are 
possible as well (Clemente & Sterman, 
1963; Magoun, 1963a). Like the inhibi- 
tory Renshaw cell arrangement discussed 
by Eccles (1964), the inhibitory mech- 
anism could be under cholinergic control 

Hernández-Peón & Chavez Ibarra, 
1963). If so, then it should be possible 
to retard or prevent habituation by block- 


ing this mechanism by administration of 
anticholinergic drugs. Such a result has 
been reported by Carlton. (1964, 1965). 
who found, in a variety of well-controlled 
tests, that the interfering effect of novel 
stimulation on behavior does not habituate 
if familiarization training is carried out 
under the anticholinergic drug, scopol- 


amine. 
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STATISTICAL LEARNING MODELS AND 
INDIVIDUAL DIFFERENCES 


R. A. WEITZMAN 


Minnesota Na 


In most applications of statistical le 
at individuals do not differ with respect to the 


his paper examines the evidence, develops a 


with but little evidence, th 
models’ parameters. T! 


statistical test for the assumption, 


data reported in the liter 


differences which affect data may not 
ions of the data. 


which provide accurate descripti 


Ne eee learning theorists have gen- 
Ssperts Bened that all the subjects of an 
fitta vent for which a model is appro- 
moder! nave equal values of each of the 
bes EI parameters; | Although this has 
ani a tacit assumption, it 15 nonetheless 
Sod A ie one. If generally true, it 

obviously have significant 1m- 


Plications for the psychology of indi- 
If true but only 


M differences. 

r i i i 

hy special circumstances, it coul 
ace serious restrictions on the applica- 


ae of statistical learning theory. 
5 TRU to Bush and Mosteller (1955, 
re ), statistical learning theorists | are 
ange podus that all subjects 
"s e approximately the same parameter 
real de in order to fit their models to 
ven ata. Despite its importance. how- 
atter psychologists have devoted ped 
Ga to this assumption. Although 
tion ey and Jonckheere (1956) ques- 
cal m the wisdom of building 
an niig theory on such an 
Cro ee base, Atkinson, Bower, an 
tha thers (1965, p. 105) seemed to believe 
tok statistical learning models are likely 
Hon. robust with respect to the assump- 
irec Not all assumptions, of cours are 
(109 testable; and, until now, 
to h ) seems to be the only 
Bue 7 attempted a direct 
evide This paper examines 
the ence, develops a sta 
noae s PUN applies 
ature s and data reported 
and i, and considers possible 
implications of the results. 


i 
explanations 


3 


tional Laborat ry and the University of Minnesota 


arning models, it is tacitly assumed, 


and applies the test to models and 


‘ature; the mixed results suggest that individual 


affect the parameters of all models 


Bower's EVIDENCE 


1962) concluded as a result of 
what he purported to be a statistical test 
that for his subjects, at least, there were 
no individual differences with respect to 
the single parameter ¢ of his model; he 
based his conclusion on the following 
premises (see pp- 50-51): 

1. The sample variance of total errors 
made by à subject will tend, under H»: 
c; = cj for an arbitrary subject j and 
every subject i Æ j, to be equal to, and, 
under Hi: €: 4 6 for an arbitrary subject 
j and at least one subject ij, to be 
larger than, the theoretical variance for 
a subject computed by using the formula 
of the model. 

2. Computed values of “F.” the ratio 
of the first to the second of these vari- 
ances, were not large enough to be 
significant. (The quotation marks in- 
dicate that, according to Bower, this 
ratio is only approximately distributed 


as F.) 


Although Bower stru 
so that support O 


Bower ( 


ctured his argu- 
f his conclusion 
rather than 


is not an unreasonable one. 
either premise is false, the 
e established. 

f fact. neither premise 
so cannot be true or 
t to the major premise. 
gle-parameter models 


If, however, 
conclusion re 

As a matter © 
makes sense an 
false. With respec 
it is true that for sin: 
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TABLE 1 
NOTATION 


—— 


Symbol Formula 


Statistic 


Dropabiliey of error by an | =m Depends on model 


on trial m g 
probability of error by an 
S on trials m &M 

e 98 Z 
h oretical proportion of 
Ss erring on trial m 


spends del 
saw | Depends on mo 


T! icc tion of 
Theoretical proporti f Lae. 

Ss erring on trials m & S mew 
"Total no. errors for an SIT = 
Sample Mean of T 


a 
»^ T; 
Theoretical variance of T | Var(T) | 3E Gear $45) 
rall 551. m i. 
Theoretical variance of T | Vart) |= z (Emst — Tz 
for an $ LE 


Average theoretical vari- 


ance of T for an S n 2 Var(f)i 


ample estimate (biased) | Sr? ee 

Sa t Var(T) iig -I 

Sample estimate (un- sri n(n — 1)7^57* 
biased) of Var(T) 

Model estimate (biased) | Sé Depends on model 


of Var(t) 
Model estimate (un- 
biased) of Var(t) 
Model estimate (un- 


biased) of Var(t) 


Depends on model 


nD sei? 


like Bower's the sample variance is under 
Hi an overestimate of the average theo- 
retical variance for a subject (see Table 
1) This fact is implied by Theorem 1 
of the Appendix. It cannot be true, 
however, „that under M, the sample 
variance is either an over- or an under- 
estimate of or even equal to the theo- 
retical variance for a subject, because 
under 77, there is no single value of the 
theoretical variance for a subject, sub- 
jects with different parameter values 
having, according to the model, different 
values of this variance, As for the minor 
premise, there is no way to compute the 
appropriate number of degrees of freedom 
for the denominator of "p^" Bower 
acknowledging no problem here evi- 
dently assumed this number to be equal 
to infinity. Such an assumption is 
certainly unwarranted, however; even 
under Ho, the value of the theoretical 
variance is not known, but can only be 
estimated from the same data as the 
sample variance. A nonsignificant value 
of “F,” therefore, contributes nothing to 
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the argument which, stripped of its 


constituent premises, is no argument 
at all. 


A Test OF INDIVIDUAL DIFFERENCES 


Although Bower's evidence does not 
withstand scrutiny, the basic idea of 
using different estimates of the variance 
of total errors to test for individual 
differences remains a good one. Theo- 
rems 1 and 2 of the Appendix form the 
basis of a test of individual differences 
which involves the use of such estimates. 
In order to avoid confusion in both the 
development and application of these 
theorems, it is necessary to use a rather 
elaborate notation. This notation ap- 
pears in Table 1. The test of individual 
differences can be applied to all statistical 
learning models for which it is possible 
to compute values of s?. The following 
example illustrates the derivation of the 
formula for this statistic in the context of 
a specific model. 


sé for Theios’ Model 


Theios (1963) presented a model 
similar to Bower's; Theios’ model in- 
volves a parameter, c, which is to be 
interpreted in the same way as the 
parameter c of Bower's model. Theios: 
assuming (like Bower) a common value 
of c for all his subjects, estimated this 
value by N/T, where N designates the 
number of transitions between successive 
learning states which (according to the 
model) a subject must undergo before 
achieving complete learning. Substitut- 
ing the corresponding estimate for ? 
single subject, N/T, into the expressio" 
which Theios provided for Var(f) yields: 


S? = T(T — N)/N. 


If the expected value of S? is equal t? 
Some constant H times Var(t), then 
së = S2/H; only when H —1 is S’ 
equal to S?. To find the value of H, 
it is necessary to determine the expecte 
value of S?. If this expected value 19 
denoted by E(S?), then, in the context 


mA 
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ar mar : R 
Theios’ model, d designating the differential operator, 


E(S2) = 
T-N 


adi dm " 
ü s(5- j)ea eor 


ay qq e iam (y - Dea Lye 


T=N 


= Q/NDTC Dnr-2- 


T=N 


(T= Ny — ory 


= aswel — od (E. TU 7 cy-/àe* 


= a/ND)e0- od (X TU — c)r7)/dc* 
T= 


LU 


a/N HeX (lL — ad (1/c)/de* 


= a/N)eva — AN + 1) ye 


= (ND -29/6 


nis furi equation from the end being 
: e because the N^ derivatives with 
pst to c of 1, 20 — 0,30 — oe 
NU — o)? are equal to zero. 

EC according to Theios 
=Ni-od/e, H=(N+ 1)/N and 


= TT = N)/(N + 1). 


iio ae preceding example serves not only 
for bue the derivation ofa formula 
di „sê, but also to help clarify the 

stinctive meanings of s? an 


Table 1). 


Test Statistic Zo 


Under Ho, according to Theorems 1 
and 2, sp? and sê, being unbiased esti- 
x respectively of Var(T) and Var), 
Should be equal. This being so, 


the 
statistic 
Zo = Dvu/sp 
where, with 


D; = s — n(n— pds ts 


D- a, Di = ser sr’ 
7 
and 
2 z TR 
ahs (gp ec) 3 GE DY, 
J 


(say, 7 > 30) ap- 
distributed with 
e. For models 


is 

peta ed Ho for large ” 
proximately normally 
zero mean and unit varian 


——— 


involving à single parameter, according 
to Theorem 1, 20 provides à lower-tailed 
test of individual differences, 2 significant 
value of zo indicating that different sub- 
different values of this pa- 
For models involving two or 
more parameters, according to Theorem 
2, Zo rovides & two-tailed test of in- 
dividual differences, 4 significant value 
:ng not only that different 

subjects have different values of one or 
parameters, but also, if 20 
is larger than zero, that subjects typically 
e side of average early and 
on the other side of average later in 


learning (see Theorem 2). 


APPLICATIONS 


Application of the test just developed 
to the models and data reported by 
Theios (1963) leads to some interesting 
results. Theios considered two models, 
one (Model 1) involving one parameter 
and the other (Model 2) involving twc 
Aside from their different 


numbers of parameters, however, the two 
i i The formula for 


sê derived in the preceding section but 
one applies with N = 2 to Model 1 and 
with N considered a parameter, its value 
to be determine: by data, to Model 2. 
Theios applied Model 1 to the data of 
rats run by himself in a study of trau- 

dance learning; considering 


matic avoi : 
the fit to be good, he applied the same 
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model to the data of dogs run earlier by 
lomon and Wynn (1953) in pe lar 
T - Considering the fit of Model 
on. he data of Solomon and Wynn to = 
vede ^n wever, Theios applied Model 2 
ais S uta and, determining N to be 
P M pd 4, obtained now what he con- 
sider d (by current standards) to be a 
= a fit. Following are the results of 
B Ning the test of individual differences 
So redet 1 and the data of Theios (1963, 
p. 416) and to Model 2 and the data of 
Solomon and Wynn (Bush & Mosteller, 
1955, p. 239). 


Model 1 and Theios' Data 


Testing Theios’ data for individual 
differences with respect to the parameter 
of Model 1 yielded a value of zo equal 
to .749. Although this value implies 
that there are no individual differences, 
it is of course possible that there are 
individual differences which are so small 
that the test has insufficient power to 
detect them. To investigate this possi- 
bility, 20 Monte Carlo experiments were 
run, each involving 100 subjects sampled 
from a population of subjects whose 
parameter values were normally dis- 
tributed with standard deviation equal 
to .001 and mean equal to .427 (.427 
being the number which Theios obtained 
as an estimate of the parameter value of 
each of his subjects). Values of Zo were 
computed for all the Monte Carlo ex- 


periments, each value being smaller than 
zero. Then, a value of 


m = (749 — zj)/A1 + 1/20, 


where zo denotes the average value of Zo 
for the 20 Monte Carlo experiments, was 
computed with the result, z, = 1.771. 
The chances that Theios sample was 
drawn from a population of subjects 
having parameter values which are 
normally distributed with a mean of .427 
and a standard deviation of .001 or 
larger are thus less than 5 in 100. This 
result implies that if there are individual 
differences among Theios' rats the dis- 
tribution of these differences, if normal, 
has a standard deviation smaller than 
001. From the standpoint of the 
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psychology of individual differences, this 
is a provocative result. It means that 
it is possible to build models which, while 
ascribing no portion of individual varia- 
tion in behavior to differential genetic 
endowment or preexperimental experi- 
ence, nonetheless are able to describe 
behavior extremely well (see Theios 
Table 2). 


Model 2 and the Data of Solomon and 
Wynn 


Testing the data of the dogs for in- 
dividual differences with respect to the 
parameters of Model 2 presents the 
problem of estimating two parameter 
values from the data of a single subject. 
Theios, assuming that all his subjects 
had the same value of each parameter, 
solved simultaneous equations involving 
T and S7? to obtain the two parameter 
estimates. Since for a single subject T 
is equal to T but Sr? is only trivally 
equal to zero, a substitute for S7? must 
be found. Aside from the formulas for 
the probability distribution and for the 
mean and variance of total errors, the 
only formula which Theios presented for 
Model 2 is the formula for the expected 
value of the trial number of the last 
error. This number, E(L), can be 
estimated from the data of a single 
subject (fortunately) by the trial number 
L of the last error. According to Theios: 


E(L) = (N/c) x (Mb 


Substitution in this formula of L m. 
E(L) and (since for Model 1, with 7 


known, c is estimated by N/ T) N/T for 
¢ results in the equation 


L/T = È (N — k41 
k=l 


Using this equation, N was estimated 5 
follows. Starting with the value 1 
Successive integral values of N were 


tried, the estimate of N— N—being 
taken as equal to that integral value for 
which 


» With the corresponding estimate 
o smaller than zero or larger than 1, 
difference between the two sides of 


ofcn 
the 


^ 
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the equation we 

edd x |^ NN Then, c was 
"S ore parameters estimated 
b T the value of zo was 
m i the result being zo = 4.046. 
of en Ly only implies the existence 
veg ual differences, but also, be- 
oen is po than zero, carries the 
Mem x urther implication that dogs 
ide ade a greater than average number 
mA m during early trials typically 
ene V sad than average number of 
That uring later trials and vice versa. 
bs Bei dogs actually behaved in this 
ll vA support from two sources. 
Wynn oS de of Solomon and 
o ti. a , p. 13) that the correlation 
equ e number of responses preced- 
number jes conreat jam me ip s 
e apes of alternations between incorrect 
ecol pest responses is =42, The 
ey eerte of .903 between the 
Sm e „estimates of N and c In 
ey E l'heios model, subjects who 
epa e (rom state to state (have 
P d f ues of c) must traverse many 
"eme opi large values of E 
Benne ion of learning, and vice versa. 
is ge speculating on how widespread 
pe phenomenon may be, however, 
"es wee to consider the results in the 

: ‘ction. 


ne 1-X" and the Solomon and Wynn 
ata. 
eios, there is at 


lea: s 
st one important difference between 


^e experiment and 
toe ad and Wynn 
sh th the rats and the dogs Were run in 

uttle boxes, the dogs proceeded on 


al i n . 
Pe sche trials in different directions, 
ereas the rats proceeded on every 

Analyzing 


trial j s E 
al in the same direction- 


t! á 
he Solomon and Wynn experiment as 
dd trials of one 
other Theios 


experiment, Model 1 applying to eac of 


$ ^ A 

aa ee experiments, 

Por el for the Solomon an 

the this model (Model 1- 

mios s states—0, 1, 
ing to the two tran 


Sh sitio 
, 
ere are five states, On or e 
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possible values oficj(j-20,12 

subject in state ij bein "d hs 
Model 1 in state ion odd ecce edes 
j on even trials. According to Theios 
the probabilities of responding correctl : 
in successive states are, for Model M 
‘00, .50, and 1.00 and, for Model 2 (with 


N=4), 00, .25, .50, .75, and 1.00 
respectively ; since for Model 1-X the 
corresponding probabilities are 

(.00 + .00)/2 = .00, 

(.00 + .50)/2 = (.50 + .00)/2 = 25. 


(.00 + 1.00(/2 = (50 + .50)/2 
= (1.00 + .00)/2 = .50, 
C50 + 1.00)/2 = (1.00 + 50)/2 = .75, 


and 


(1.00 + 1.00)/2 = 1.00, 
nd Model 2 (with N = 4) 


should have identical fits to data. 
Testing the Solomon and Wynn data 
for individual differences with respect to 
the single parameter of Model 1-X 
resulted in a value of zo equal to .530, a 
value not at 2 i istent with the 
is of no individual differences. 
andpoint of the psy- 
differences, is 
ive than the 


Model 1-X a 


but for a circum- 

‘ated out in the next 
be attributed to the homo- 
of breeding and preexperimental 
teristic of laboratory 
d Wynn subjects, 
ver, were mongrel dogs. 


howe 
tent that Models 1 and 1-X 


To the ex 
fit the data well and properly, as indeed 


at least Model 1 seems to do, both the 
Theios rats and the Solomon and Wynn 
dogs demonstrated a rather curious 
phenomenon: hey did not by any 
noticeable amount, positive or negative, 
habit of running in one 
abit of running in the 


rats. 


direction to the h 


half, the earlier learning 
thus constituting 4 highly relevant and 
experi- 


differentiating preexperimental 
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i t of Model 1 to all these 
ae ede em the slightest tendency 
for individual parameter values to vary. 
As for the dogs, Model 1-X explicitly 
excludes the — of transfer be- 

o habits. 
oet of this and the preceding 
two sections run counter to expectation; 
it is instructive to attempt an explana- 
tion of these results. Although Models 
1-X and 2 were fitted to the same data, 
individual differences affected the fit of 
Model 2, but not of Model 1-X. Had 
individual differences failed to show up 
in the fit of Model 2, the explanation 
might have been simply that traumatic 
avoidance learning is too rapid or that 
data of ones and zeros are too coarse to 
reveal the existence of individual differ- 
ences. Because individual differences 
showed up in the fit of Model 2, however, 


not. In 
individual 
stribution, 
t responses 


A Note on COMPARING THE Fits 
or MopELs 


Model 1-X with one parameter fits the 
same data as well as Model 2 with two 
parameters. This does not necessarily 
mean that Model 1-X is better than 
Model 2. The comparison is not fair; 
Model 2 was fitted without taking into 
account the effect of individual differ- 
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ences on its parameters. Whereas Model 
1-X was fitted properly, Model 2 was 
not. A proper fit of Model 2 would have 
to take into account the effect of in- 
dividual differences. That Model 2 
without accounting for this effect fits the 
data as well as it does, moreover, would 
seem to be a matter of chance. Had 
Theios used E(L) instead of Var(t) for 
parameter estimation, he would have 
obtained estimates of N and c equal to 2 
and .256, respectively, and the corre- 
spondingly poor fit of the dashed line 
of his Figure 6. 

Generally, fairness in comparing the 
fits of different models to the same data 
requires testing the fits for individual 
differences; for a test result which in- 
dicates that individual differences affect 
a model's parameters implies that the 
fit tested would be better if it took this 
effect into account than if it failed to do 
so. The following example illustrates 
that the improvement in fit might be 
substantial. 

Consider two Model 2 estimates of 
Var(T), one which takes into account 
the effect of individual differences and 
one which does not. Consider the second 
estimate first. If Theios had used E(L) 
instead of Var(t) for parameter estima- 
tion, he would have obtained 22.62 as 
such an estimate. The corresponding 
standard deviation is 4.76. 

Now, consider the first estimate. A 


formula for Var(T) alternative to that 
of Table 1 is 


E m 
Var(T) = 55 gà, — CE EX)? 
[mr] [mr] 
where K = s and, with Aj designating 
the probability that a subject will make 


a total of T = k errors before completion 
of learning, 


X. =n] px YT 
je 
Theios (1963, p. 410) presented a formula 
for the 


Probability that a subject will 
make a total of 


A T =k errors before 
completion of learning. Substitution 
into this formula of the individual esti- 
mates of N and c which were used in the 
test of individual differences for Model 2 
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and the data of Solomon and Wynn 
resulted in corresponding individual esti- 
mates, Xj, of the X; substitution of the 
Än (by way of the formula for X) 
together with a value of K = 50 into the 
preceding formula for Var(T) led to 4.01 
bee Model 2 estimate of Var(T) which 
akes into account the effect of individual 
differences. The corresponding standard 
deviation is 2.00. 
' Since Sp? = 6.36 (Sp = 2.52), a Model 
t estimate of Var(T) which takes into 
account the effect of individual differ- 
TN fits the data substantially better 
ces a Model 2 estimate of Var(T) which 
does not take into account the effect of 
individual differences. 
_ Theios, using current standards, 
poen his fit of Model 2 to be good; the 
M that, accounting for the effect of 
eei differences, the fit might be 
E ter suggests that testing for individual 
i an ey is not necessarily counter- 
2 Visa by a good fit, as, for example, 
inson et al. (1965, p. 105) seemed 
to believe. 
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APPENDIX 


THEOREM 1 


k Consider a model consisting of only a single 
patometer c which represents rate of learning. 
amid cj designate the value of c for subject j 
SH E the value of c for subject å 6 = h^ 

rd = ty f + hea apd = h2 n); 
pa define Ho: c; = c; for an arbitrary subject 
pers every subject ¢ and His G * G for an 
f itrary subject j and at least one subject i. 

Var(T) denotes the overall theoretical 
variance and Var(t) the average theoretical 
ariance of total errors for à subject, then, 
Under JZ», 
Var(j) = Var(T) 
and, under Il, 
Var(t) € Var( T). 

Proof. Under either Ho oF Hy, let Tm 
*signate the probability of selecting at "m 
be a subject who will make an error on md 
sel and let 7,3; designate the probability © 
ere at random a subject dic r e 
‘N error 2 i. M. so, und 

on both trials » and ^ oe ability 


° or Hi, | designate the pr 
th 1, let mm designa 
at a selected subject will make an error on 


nate the probability 

]] make an error on 
Then, under Ho, 
and, under Hı, 


trial m, and let *m desig: 

that a selected subject wi 

both trials m and M. 

fim = 74 and sa = TmMi 

fim = (n3) È Tmj 
i 


and 
Sax 007) PEN 
i 


By definition, 
E (rnui — meirati) 

a 

ent of terms and ap- 


may be written, 


Var(t) = (n7!) z 2 


which, on rearrangem 


propriate substitution, 

Var() = È Zima — DD (wt) E rmjs 
mM m M i 

that 7mj = Fim + Smj- Then, 


Define 54; SO 
hat under either Ho or Hi 


it is seen that un 
(i) Z ôni = 0 

i 
and that, since wj is larger or smaller than 
average as ĉj is smaller or larger than average 
for every trial m, under Ho Cov(m, dar) = 0 


and under Hy, Cov (rs à) > 0 for every pair 


of trials m and M. 
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Substituting appropriately in . E UT 
expression for Var(t) and simplifying yield: 


TET S D Ga — Fito) ] 
Carlt) = CE E Gi» 


m M MB 
— X ECovtó,, oy). 
m M 
ession in brackets is, by 
since the expression 1n X 
^ M equal to Var( T) and since under He 
ae + 


€ E Covina, ôu) = 0 
mM 


and under Mi 


X X Cov(ôm, 5) > 0, 
- M 


then, under Zo, 


Var(t) = Var(T) 
and, under Hi, 


Var(t) < Var( T). 


‘THEOREM 2 


Consider a model consisting of two or more 
parameters c, d, «-- which represent learning 
rate and possibly initial or terminal perform- 
ance. Let cj, dj, +--+ designate the values of 
€, d, +++ for subject j and ci, di, «++ the values 
of c, d, +++ forsubjecti(i = 1,2, +--+, (j -— 1), 
(GAA) enj = 1,2, +6 +, n), respectively ; 
and define Ho: c; = cj and d; = d; and --+ for 
àn arbitrary subject j and every subject i and 
Hi: c Æ cordi s djor +++ for an arbitrary 
subject j and at least one subject i If 
Var( T) denotes the overall theoretical variance 
and Var(t) the average theoretical variance 
of total errors for a subject, then, under 77a, 


Var(t) = Var( T) 


and, under M, there being two cases 
when perform 


» (a) 


ance is consistently above or 
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below average throughout learning, 


Var(t) < Var(T) 


. : f 
or, (b) when performance is on one side e 
average early and on the other side of averag 
later in learning, 
Var(t) 2 Var( T) 4- W, 
where 
i| > o. 


Proof. Proceed as in the proof of Theorem 
1, establishing the equation 


Var(t) = [X z (Emst — Vau) 


m 3 


— SX X Cov(5,, ôn). 


mM 

Since the expression in brackets is, b 
definition, equal to Var(T) and since 
under Ho, 


D X Cov(dm, dar) = 0; 
m M 


under H, (case a), 


E X Cov(dm, dsr) > 0; 


mM 


and under M, (case b), Cov(óm, dsr) being 
greater or less than zero for different pairs ^ 
values of m and M, 
| Y E Cov(ôm, ôx)| > 0; 
mM 
then, under Zo, 


Var(t) = Var( 7); 
under J, (case a), 

Varti) < Var( T); 
and, under //, (case b), 


Var(t) = Var(T) + W, 
where 


IW] 2 0. 


^i 
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OPTICAL TEXTURE VERSUS RETINAL PERSPECTIVE: 


A REPLY 


ROBERT B. 


Pennsylvania 


Differences in numerical analysis 
ing from perspective t 
Flock's criticisms © 
shown to be in error. Converse 
cients as a measure o! 
for description of the 


the effects of texture gradients on 
ns of simple assu 


ted for better, at least in a qualitative 


of perspective theory. By mea 
Flock and others can be accoun! 
sense, in terms of perspective 


di This reply to Flock's (1965) note isa 
M speeds of Flock's numerical analysis 
cd e results of investigations of visual 
slant, of his unfounded criticisms of my 
enn of retinal perspective, of the 
eri tae between his theory of t 
ges and my theory of perception, and 
ti alternative variables for the specifica- 
on of texture perspective. 


NUMERICAL ANALYSIS 
cited in Free- 
(1965), have 
ineffectiveness 
cue to slant, 
s of linear 


NU empirical studies, 
x (1965) and Flock 
ct onstrated the relative 
ml ANM gradients as à 

en compared with the effect 


ence in visual angle size of thé nearest 
» of surface 


nable in most cases. 
COrrect, since typical Tex 
(eg. small circles scattere 

SOB eNeOUS surface) SU tend small 
x. angles, and therefore are hie 
ern ee only minima’ size perspec 
lo: even for extreme stants. However 
ock’s analysis here is irrelevant. 


tical texture-gradient 


"s This note was written in connec’ 

Search supported by ran F 10 

Pu Institutes of Health, Uni 
lic Health Service. 


heory and textur 
í the mathematica 


f perceived slant is criticized as 
data. Finally, 


theory than in terms 


TO FLOCK* 


FREEMAN, JR. 


State University 


and experimental predictions result- 
e-gradient theory are discussed. 
1 specification of perspective are 
ly, Flock's use of regression coeffi- 

being inadequate 
results of experiments showing 
visual slant are analyzed in terms 
imptions, the data of 
of optical theta. 


ggest that relative angu- 


lar size of texture elements might be a 
to slant, Flock (1965. p. 


508) suggests that it does: 


... not only did I discuss the question of 
surface texture and visual acuity, but I also 
predicted .. . that as a term like Ox-r 
approaches liminal values, surface texture 


will be increasingly ineffectual in eliciting 


good slant judgments (Flock, 19642, p. 386). 
What 


Flock has misquoted himself. 
Flock (1964, P- 386) had suggested was 


that visual angle ratios of optical 9" 
(which is his measure of texture perspec- 
tive) may be very close to unity when 


the field of view is very restricted. 


On this point, I heartily agree with 
Flock. As will be shown below, linear 
i as defined in Equation 


ing horizontal and 


1964) does not su: 


ing contours. 
ive i ive cue for slant, 
e much more effective when 
over wide extents of the 
The effect of variation of 
f the field of view on the 
ived slant ofa textured surface has 
two implications for the theory of "opti- 
». (a) Texture perspective 
variation in field 


invariant with 
Tlock (1964, pp. 


as predicted by 


of view, 3 
388-389) ; (b) since perceived slant 
varies with feld of view, optical theta 

ately the slant of a 
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distal surface from _ optical variables 
[ Flock, 1964, B: = borne out by the 
This md ant aci reported by 
study by Flo . 511-512) in his note. 
Flock (1965; Dp. which is discussed 
That ExDe shows that the effectiveness 
further below, ture varies widely with 
of a sandai m f view. This finding 
iation in field of v $ Te 
yne the theoretical value of op ica 
ng d shows the expected nonveridi- 
ma e slant judgments dependent upon 
Cc 
es. 
Pa elite states that the data in F ree- 
man's (1966b) investigation of visual 
slant using textureless. rectangles are 
"artifacts of his experimental method 
(Flock, 1965, p. 509) because of the 
matching technique used. He says that 
"Ss' tasks as they accepted them may 
have been to match trapezoids for simi- 
larity, rather than slanted rectangles for 
slant equality [pp. 509-510].” He then 
goes on to point out that, even under 
circumstances where linear perspective 
was presumably subliminal in my experi- 
ments, the regressions of matched slant 
on the physical slant of the various 
standard rectangles were generally close 
to L0. There 


are two comments which 
I should like to make concerning Flock's 


Statements. In the first place, if linear 
perspective had been irrelevant as a cue 
to slant, Subjects would not have been 
affected to the extent that they were by 
perspective in their slant-matching task, 
Their responses would have been more 
nearly accurate, regardless of rectangle 
size, had they matched visual angle axis 
ratios rather than attempting to match 
Perspective, In fact, as Flock Points out, 
when perspective was subliminal, slant 
matches were quite unaffected by physical 
(and angular) size, indicating that per- 
spective is a dominant cue when it 


is dis- 
criminable, Second, the use ofa regres- 
sion coefficient here is inappropriate and 
without descriptive value, Because of 


the very wide Tange of stimulus slants 
used in my experiment, experimenta] 
variance of slant matches attributable to 
the effects of perspective will be small 
relative to the variance of slant Settings of 
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the standard. Consequently, the e i 
tude of the obtained regression — i 
depended as much on the range of P 
settings used as it did on the CEDERE = 
conditions under which they were og E 
This objection applies to the use o ye 
gression coefficients in the description b 
all of the data referenced by Flock - 
Table 1 (Flock, 1965, pp. 506-507) Lr 
in the text. The individual plotting 4 
response measures (data points) ue 
more desirable method for the presen : 
tion of data and will doubtless continu 


a al 
in use by investigators of perceptua 
psychophysics. 


RETINAL PERSPECTIVE AS A 
Cuz For SLANT 


Retinal perspective may be specified in 
various different ways, but generally ve. 
fers to the convergence of the rim 
separation of the projected locus of poin 3 
of equal physical separation with Ne 
ing distance from the eye. Pepen 
may then be measured by obtaining t 
difference in visual angle 
specified ne 
When conto 


separation o 
ar and far pairs of aya 
ur perspective is involved, Fe 
difference angle in the projection of bw 
near and far edge may be taken as t e 
measure of perspective, This measure 
varies not only with the slant of the ae 
ulus surface, but also with its Hp h 
width, and distance. For a stimu 2 
rectangle slanted about a horizontal aX 2 
perpendicular to the visual axis, perspec 


tive may be approximated by the rela- 
tionship, 


2 ad sin 8 


= DELL St — 
ô = tan C+ d —gqsmó' 


[1J 


Where a is the linear distance from e 
upper or lower edge of the rectangle ce 
the axis of rotation, d the linear distan! 5 
from the left or right edge to the VERMES 
center line, c the distance of the center 
the rectangle to the observer's eye, an 


e 
the (positive or negative) slant of ci 
stimulus, The equation yields a value 
Which, for 


iewe 
rectangles which are e 

normally to the center point of the ax! ad 

rotation, is equal to one-half the vi$ 


a, 
SS LO 
SS SSS, 
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angle di 
Sai ee between the near and far 

In his Fo amten reptania 
uus iiie t, Fock (1865p. 51 
Mdh bases 1i this equation is “false.” 
tion that a his statement on his assump- 
decreases e the observation distance (c) 
should d zero, tan 8, and therefore 8, 
analysis is e approach zero.  Flocks 
contrary , of course, incorrect. On the 
8, and en £ sSsuies VERY large values, 
Proaches ies g retinal perspective, ap- 
Separations of a In fact, the angular 
With incen all pairs of points decrease 
"lock's ded dex observation distance. 
Thy equatior two statements, concerning 
Measure ( ns for a possible alternative 
tradictor T) of perspective, are selí-con- 
them, ind Es s can be shown by quoting 
is aa teire require no comment : 
changed p man’s] statement that z is u 
13 self-evicl the height of the rectangle 
7 at the ments ss. Ihe angular size O 
the hei He very definitely changes with 

Final of the rectangle.” 
me with d Flock (1965, p. 511) charges 
Variable t reluctance to accept any other 
than the contour convergence 9 


Slant 

"ted p 

ceived amd as a stimulus for per 

Consist, ant..." His statement is a 
ith 


Neo with my position that W 
nd visual cular observation, visual shape 

inear slant are a function primarii} 
logs car Outline perspective [Freeman 
tour he 503; italics added].” When con- 
i hath 2 and texture perspective 
Sante ! Visible in a monocularly-viewe 
Well h Stimulus, texture gradients may 
Slang, Ave a minor effect on perceive 
Primari] owever, the effect should be 
naly,.* "egative, if the geometrica 
Slant, gi Equation 1 is related to visual 
ques a Ince a and d assume maxima 
Ps respo the edges of a slanted stimulus, 
Specti L^ 5€ of the eye to the retinal pet 

arising from those € 


hy 
St al 
8 s S i 
Decet, 9 De maximal. Conversely. m 
points 


Ctiy 
ic Ctive 
Cte ure a vergena of pairs O 

2b elements”) within the bound- 
fy; "quati, ^ 
S pce a other formulas for Dis 

ua 1 ormu of sian 

Deng: >, ari angle projection a 

Aix) € derived in Freeman 1966c, AP 
Or reasons of economy» they are 
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aries of the stimulus figure must take 
smaller values. Thus perceived slant 
could (although not necessarily must) be 
decreased as a result of visible texture 

adients, but cannot be increased! This 
will be true even if internal (element) 
and contour perspective are averaged by 
the eye, since the perspective difference 
angles (8s) of a substantial proportion 
element points must generate 
f minimal slant. Conversely, 
tive takes such a low 
value that it is not discriminable by the 
eye (as happens at small slants and in 
very small stimuli at any slant), then 
texture gradients must also be indiscrimi- 
nable, since their visual angle variation is 
necessarily less than that of the bounding 


contours. 


of pairs of 
sensations 0 
if contour perspec’ 


Tyrory OF STIMULI vERSUS THEORY 


d optical theta as being E theory 

:muli rather than à theory of percep- 
of oc . 514) does not feel 
x is gained by the distinction. 
deal is gained 


; be se 
OF si al parameters 
fy the proximal stimu 
or face. The proxim 


Jus to the re- 
al stimulus is 
some trans- 


oducing source. + 
pe optical theta is such 
tion. Since “the slant angle, 

Lae 9 i . . . optical 


e y phy 
slant get ON feta) must Be 
se, then optical 

effective PSY- 

d the transforma- 

plicit in 
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i y is a theory of the 
re-gradient theory is o 
vine stimulus which is meaningless for 
the analysis of vision. 


ALTERNATIVE VARIABLES OF 
TEXTURE PERSPECTIVE 


i unpublished study by 
A p denen briefly by 
Flock an 511-512), indicates that 
Flock (1965, pp. 511- ? fi weak 
ndom-texture gradients, althoug! weak, 
€ function as a cue to slant. Flock 
“1965 p. 511) asks how linear perspec- 
Nes ca predict the findings of this ex- 
periment whose slant stimuli were ran- 
dom-texture fields and lacked linear con- 
tours. This section will show that: (a) 
The theory of optical theta cannot ac- 
count for even the data of Flock and 
Graves; and (b) four postulates derived 
from linear-perspective theory are capable 
of predicting their results in a qualita- 
tive sense. 

The "elements" of the textured field 
used by Flock and Graves were large tri- 
angles of random size and shape, averag- 
ing 4.5? in visual angle extent. Viewed 
through apertures of 20°, 40°, and 80? 
of solid angle, at each of six unspecified 
slants, the regressions of the unspecified 
slant judgments on physical slants were 
10, .30, and .69 respectively. None of 
the three regression coefficients ap- 
proached the value of 1.0 predicted by the 
optical theta transformation, and all of 
them are exceedingly low in view of the 
very large visual fields used. Finally, 
the regression coefficients increased with 


increasing viewing field—a finding not 
predicted by optical theta, 
Granting the lack of Psychological 


validity of optical theta, how is it pos- 
sible for the eye to respond to texture- 
density gradients at all?3 Stated in 


8 In his Footnote 6, Flock (1965, p. 512) 
explains how linear outline perspective can 
be considered an "extreme and special case" 
of optical theta. However, the visual re- 
Sponse to linear perspective does not conform 
to optical theta, but rather to retinal perspec- 
tive as defined in Equation 1, Visually 
matched slant (Freeman, 1966b) increases 
and slant threshold decreases (Freeman, 
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other terms, what properties of the visual 
texture vary sufficiently with variation in 
slant to be detectable by the eye? I 
should like to suggest four visual cues 
which may, under some stimulus condi- 
tions, such as those used by Flock, con- 
tribute to a perception of slant. None 
of these requires any unusual assumptions 
about the discriminative capacities of the 
visual system. 

1. For relatively large texture elements, 
large visual fields, and sizable slants, pe 
projective difference in visual angle size 
of near and far texture elements may well 
contribute to the perception of slant. 
This is possible under two circumstances: 
(a) when texture elements are all of the 
same size, in which case the visual task 
is similar to that of responding to linear 
perspective, (b) when the relative size 
of irregular texture elements is known or 
perceivable. In any experiment which 
uses repeated presentations of a textured 
surface at a variety of slants, it is ot 
course possible for the observer to obtain 
information concerning texture element . 
size at the near and far boundaries of the 
visual display. 

The size perspective of equally (oF 
similarly) sized texture elements can be 
obtained with an equation of the form of 
Equation 1 when the distance (a) of the 
element from the axis of rotation, the 
diameter of the element (24d), and obser- 
vation distance (c) are known, and the 
entire surface is always visible. In the 
Flock and Graves experiment, however, 
the limits of the area of the surface which 
was visible through the 20°, 40°, or 80 
field stop were dependent upon the slant of 
the textured Surface. Under such con- 
ditions, the visual angle difference (28) 
between the elements at the near and far 
extremes of the visual aperture is give? 
exactly by means of the law of sines: 


2 cd sina tan 8 2 
ô = tan ! ba 2] 
€ + d'(Y — sinta sec? 0)’ [ 
pM NM" 


19663) with the increasing linear dimensions 


of the outline rectangle as predicted by per- 
spective theory, 
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where c is the distance of the observer's 
eye from the axis of rotation, 2d the 
diameter of the texture elements, 0 the 
slant of the surface from the fronto- 
parallel plane, 2a the visual angle limit 
of the visual field, and 28 the total 
visual angle difference between the near 
and far texture elements. Equation 2 is 
appropriate only when both the near and 
far elements are centered in the visual 
field. 

One can gain an appreci 
enormous size perspective pos 
the conditions reported by 
Graves by means of Equation 2. These 
investigators report an average visual 
angle element sizg at 0° slant of 4° 30". 
Let us assume that this angle was pro- 
duced by a texture element whose diam- 
eter was 10.6 centimeters (d — 5.3 centi- 
meters) viewed at a distance (c) of 135 
centimeters. Let us assume that a sur- 
face is viewed at a slant (8) of only 40° 
through an aperture of 80° (a= 40°). 
Using Equation 2, we obtain a vane | 
8= 2° 25,5’, The total visual angle dif- 
ference is therefore 28 = 4°51". : 

The value of nearly 5 degrees of size 
perspective is clearly large enough to be 
a strong factor in the visual perception of 
the slant of textured surfaces. Such a 
very large value for 28 is the result of 
the fact that, when viewed at 4 substan- 
tially large angle through @ large aper- 
ture, the visible linear extent of a tex- 
tured surface in depth is also very — 
Letting a be the linear distance iror Of 
axis of rotation to the farther limits © 
the visual field, and a’ the linear distance 
from the axis of rotation to the nearer 


ation of the 
sible under 
Flock and 


alue of 


_ limits, then, by the law of sines. 
bou DH 
a = ctna cos) — sin 0 
jas m — [3b] 
a = etna cos 0 + sin 
Combining and simplifying, 
(4) 


ata= 1 


2 cos! Ü + cos 2a — 
as defined above. 


e, 40", 


where all values are 


With the values «= 


- 135 centimeters, the obtained (a+ a") 
is 586.51 centimeters, This value is more 
than 4 times the distance from the eye 
of the observer to the axis of rotation 
of the textured surface. 

2. A second possible cue for slant under 
such conditions may be monocular accom- 
modation and image clarity (or lack of 
it). Although these cues have been 
found to be very weak or even negative 
(Heinemann, Tulving, & Nachmias, 
1959), they may play a minor role in 
space perception when large distance 
differences are involved. 

3. When the same textured surface is 
presented repeatedly to the same subject 
at different slants, the total number of 
visible elements may well be a condi- 
tional cue to slant perception when sur- 
faces are viewed through apertures of 
fixed visual angle as in the Flock and 
Graves experiment. Assuming equal 
average separation of texture elements at 
all points of the visible surface, the num- 
ber of visible elements must therefore 
increase as à linear function of increase 
of visible area of the surface with in- 
creasing slant. It can only be a condi- 
tional one, however, since the number of 
visible elements is a function of (a d: a’). 
which is equal at equal slants in either 
direction, multiplied by the density of 
the elements. If other cues for direction 
of slant are effective, then number of visi- 
ple elements can provide a secondary cue 
to amount of slant. 

4, Finally, there may also be some 
perspective cues generated by the change 
jn average angular separation of texture 
elements. In texture-gradient theory, the 
response to texture perspective requires 
that the eye be able to “count off equal 
numerical units of congruent luminous 
patterns along an optic meridian [Flock, 
1965, p. 389]." Such an assumed mech- 
anism of vision seems most improbable, 
especially in view of the demonstrated 
inability of the eye to estimate quantity 
beyond a critical value of about seven 
(Kaufman, Lord, Reese, & Volkmann, 
1949; Miller, 1956). A more parsimont- 
ous assumption would be that the visual 
system responds as if there were linear 
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contours whose retinal webs eoque 
then be evaluated. Presumably ! y 
could accomplish this only if it de ea 

i the texture elements 
gus ot rA nre onld be: posible 
neatly: the variability of texture-ele- 
only when To falls below some critical 
ment separations 

A the average texture-element sepa- 

ratio to The texture-variability assump- 
Dacha ould then be the only one neces- 
ien | explain the capability of the eye 
an register texture perspective. Con- 
versely, when the variability of the tex- 
ture elements exceeds that critical value, 
the eye should be incapable of selecting 
near and far element pairs of approxi- 
mately equal linear separation whose 
visual angle differences are to be 
evaluated. 

All four of these potential cues for 
visual slant vary with the solid angle size 
of a fixed viewing aperture. Variables 
1, 2, and 4 (Variable 3 is not applicable) 
are affected by the linear dimensions of 
slanted surfaces whose view is not ob- 
structed by a field Stop. Since all of 
these variables are so affected, if any 
combination of them is effective as a 
cue for visual slant, judged slant will in- 
crease (positively or negatively) with 
increasing viewing area of a slanted sur- 
face. Similarly, the regression coeffi- 
cients used by Flock will increase with 
increasing viewing area, These predic- 


tions are Supported by data of Gibson 
(1950) and Flock and Mo 


in their 
field of view 


viewing 
aperture, 

Finally, the prediction of Perspective 
theory that visual slant will increase with 
increasing viewing area must be com- 
pared with the prediction that the effec- 
tiveness of optical n will be restricted if 
the field of view is “too small.” Under 
such circumstances “the number of like 
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elements represented in the optical tex- 
ture may be too few [Flock, 1964, p. 
386].” This statement implies, however. 
a discontinuous scale of discriminability 
of slant under texture theory, from ab- 
sence of discriminability under unfavor- 
able conditions (small viewing aperture) 
to near-perfect discriminability with ade- 
quate apertures. It does not state that 
visual slant will vary with aperture size, 
as predicted by perspective theory, when 
aperture size has reached or surpassed the 
critical value. It is this prediction of 
perspective theory which is supported by 
the data of Gibson, Flock and Mosca- 
telli, and Flock and Graves. 


CowcLusiox 


Perspective theory can provide the 
basis for the prediction of outcomes of 
other experiments involving texture per- 
spective. Perceived slant of textured 
surfaces will be correlated with texture- 
element size, size of visual field, regular- 
ity of texture, as well as the physical slant 
of the surface. Under no circumstances, 
however, will the eye prove more sensi- 
tive to texture perspective than to linear 
retinal perspective, except when the tex- 
ture carries a regularity equal to that of 
outline perspective, as in a checkerboard 
Pattern. Even here, however, the linear 
perspective generated by the outline of the 


pattern must produce the greatest percep- 
tion of slant. 
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COMMENTS ON "A POWER COMPARISON 
OF THE F AND L TESTS—I" 


PAUL A. 


GAMES 


Ohio University 


30ersma, DeJonge, and Stellwagen 
tion except that where the specific 

rue. 
peers of using ordered-hypotl 
power comparison provides little use 
vs. nonparametric choice of tests in 
omnibus test situations. The conclus 
of the L test is somewhat dubious in 


strate equal risks of Type I error in the 


simulation of data that was inappro: 


blocks analysis of variance or the L test. 


In their article, “A Power Comparison 
of the F and L Tests—I," Boersma, De- 
Jonge, and Stellwagen (1964) seem to 
disparage the general usage of the omni- 
bus F test. They state that “. . . the most 
frequent practice is to perform the omni- 
bus test first and follow with a directional 
test only if the omnibus test leads to a re- 
jection of H,. We do not maintain that 
this is good practice, but it is almost uni- 
versal in application [p. 507]." Following 
this, they state: "This research gives em- 
pirical support to the prediction that the L 
test is more powerful for testing ordered 
hypotheses than the omnibus F test for the 
randomized blocks design [p. 512]." The 
authors avoid recommending that a po- 
tential user drop the F test and replace it 
with the L test in his next experiment. 
Lest the casual reader come to this con- 
clusion after reading the above article, 
the following points should be considered. 

In the F test, the null hypothesis, EF 
M,= Ma... My, is tested against all pos- 
sible alternatives, or not H,. If H, is 
retained, then no further steps are justi- 
fied, but if H, is rejected, then further 
procedures (such as all possible ; tests) 
are carried out to determine a likely al- 
ternative, such as M, = Mə < M< M4 
This procedure certainly leaves room for 
improvement in that the rejection of the 
null by the F test provides no information 
on which of the many alternatives should 
be accepted; any of the subsequent pro- 
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(1964) fail to consider any situ- 
ordered alternative of the L test 


They thus fail to consider the general statistical inference 


tesis tests vs. omnibus tests. The 
ful information on the parametric 
either the ordered-hypothesis or 
ion of consistently superior power 
that the authors failed to demon- 
2 tests and used computer 


priate for either the randomized- 


cedures are not well designed for this par- 
ticular purpose. Other omnibus alterna- 
tive procedures are being sought that may 
eventually replace the [f sequence. A 
two-sample sequential procedure is avail- 
able (Bechhofer, Dunnett, & Sobel, 1954) 
that enables the experimenter (E) to 
specify: (a) a region of relative indiffer- 
ence (eg., if |M, — M;| <s, the differ- 
ence is regarded as trivial), and (b) the 
probability (P) of arriving at a correct 
ranking of the population means if the 
adjacent means are more than s apart; and 
then yields a. solution lor the sample size 
required to accomplish this task, If ES 
are willing to allow statistical inference 
procedures rather than administrative 
considerations to determine sample size, 
it is possible to arrive at statistical con- 
clusions of greater clarity than: “The 
population means are not equal," even in 
the absence of any a priori hypotheses. " 

However, the L test requires that J? 


Specify a single alternative in which all 
means involved are a priori ranked. E 


then tests the model of equal population 
means against this single alternative. If 
this alternative happens to be true, noth- 
ing can be lost by this Procedure, and we 
would usually expect an increase in power 
with respect to this one alternative condi- 
tion. When one can specify population 
means on a computer, and even arrange 
them in a nice linear trend so the differ- 
ence between all means is constant, it is 


= or 
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rem d Pere the a a one-alterna- 
Al power than a test which 
: stribute the probability in the re- 
a rejection over a larger region to 
: many other alternatives. Unfor- 
ora most of the time E's a priori 
sd way ee population means 
computer siición "Us n aig zd 
eq epe lation. sually we are do- 
pus experiment and carrying out a 
n ical analysis precisely because we 
med UE background to doubt our à 
ae 'i predictions. Before adopting the 7. 
gestes such a practical situation, several 
A * p completely ignored by Boersma 
m at 64) are quite important. What 
^ s probabilities of falsely accepting 
see ae if another alternative is 
Dan 
(making a Ty TF sa et 7 inier 
conse ga Type I error)? W hat ae kun 
Piu quences and/or the relative impor- 
ance of the various kinds of error in such 
Situations ? 
„Kaiser (1960, p. 
El. particularly repugnant Qo + “ETTC 
Teie third kind’. . . involve deciding 
AE a difference in the wrong direction." 
M a was speaking of concluding that 
"i Mz when in reality M, > Ma (in a 
su e situation), however, the same kind 
rror and the same repugnance may be 
ier the k-mean situation. With 
"y test, it is certainly feasible that one 
Potl he many other alternatives to the hy- 
b esis of equal means may be true, but 
y sampling fluctuation the obtained sta- 
tistic leads us to reject the null and falsely 
Accept the alternative. If M,=Me € Ma 
=M, or if Mf, € Ms C Ma? Mo Ue 
probability of obtaining sample data such 
that X, < X < X, < X, and the L tett 
is significant may be high. Other "n 
lernatives more drastically different n 
interpretation might still have 27 a 
Preciable associated probability of obtain- 
ing a significant L statistic. Should the 
fidssessar of a significant statistic 
take his alternative hypothesis 25 proven 
(within the usual limits of statistical in- 
ference), or should he have some of the 
reservations of the usual F-t sequence? 
Another untouched statistical inference 
Point in the article is t o het 


163) has written: 
‘errors 


he process t 


2 


nya if the null is retained. In the usual 
-} sequence this is th 
e end of the game, 

but what about an E who has predicted 
A 1 7 : pies 
"durs s Ms < M, and obtained X, > 
ate A X, or some other interesting 
d iant from his prediction? Is he 
m y mE to run only the Z test against 
is pre iction and stop. or will publica- 
tion pressure or common sense cause him 
to resort to conventional methods to test 
this trend? Running a series of statistical 
tests on the same data is a sure way to i 
late the risk of a Ty disent 
ate the risk of a ype I error above the 
nominal value reported (Goldfried, 1959) 
. E must consider such factors in decid- 
ing whether to run an omnibus test or an 
ordered-hypothesis test. If he has de- 
cided to conduct an ordered-hypothesis 
test, then he has a further choice between 
the nonparametric L test and the para- 
metric F test for linear trend (Grant, 
1956; Winer, 1962). Although the para- 
metric test is sensitive to an additional 
violation of the null hypothesis that is ig- 
nored by the L test (namely Mi 2 
Ma? M4), these two tests ask the same 
general question of the data, and a power 
comparison is pertinent as à basis of 
choice between them. Similarly, if E has 
decided on an omnibus test, a power com- 
parison of the parametric F test and the 
nonparametric Friedman (1937) chi- 
square test would be valuable to assist in 
choosing between these two alternative 


tests. However, the comparison of the 
st and omnibus F test 


ower of the L te [ 
fails to clarify the appropriate parametric- 
choice in either situation. 


nonparametric 
atives used by Boersma et 


For the altern 
al. it is necessarily true that the F test 


for linear trend will have greater power 
e omnibus F test and that this 
superiority will increase with k. It thus 
seems quite likely that if the L test were 
contrasted with the comparable parametric 
he later would prove to be more 


test, t d c 
powerful. In addition, the following 
methodological considerations suggest that 

1a et al, conclusion of con- 


even the Boersm 
sistently superior 
over the omnibus 


dubious grounds. 
The major conc 


power of the L. test 
F test is based upon 


ern of statistical in- 
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m h trol 
DS r been the con 
ce has traditions y pde ps 
I Ka the relative merits of 
xs respect to the risk of a 
the tests W! Unfortunately, their study 
Type terror samples precisely at the 
failed to ems 3(M,—M,)?=.0 (the 
pant WE sl tins) but they did run 
null hypothesis 1s e) Under this 
M,— M,)? = .01. nder 
when X(À ^ri ires 3-6, their data show 
condition, in Figures reel P rejet- 
L test as having probabilities of rejec- 
the istently above the nominal .05 
eor pow nificance, while the F test stays 
ENT enn this level. In Figures 5 and 
canh probabilities of rejection of the L 
test approximate .15 and .10 at this point. 
Boersma et al. discuss this as evidence for 
the superiority of the L test for very small 
differences between the means. An al- 
ternative explanation that they do Not 
discuss, and do not eliminate, is that it is 
evidence for the inferiority of the L test 
in controlling the risk of Type I error, 
If the L test has an actual risk of a Type 
I error of approximately .10 when using 
the .05 level of significance, this would be 
a substantial liability yet would produce 
elevated power curves. Before attribut- 
ing such curves to power virtues, it would 
seem appropriate for the authors to dem- 
onstrate that the two tests do indeed have 
similar levels of risk of Type I errors. 
The second objection is that the com- 
puter approach employed was completely 
inappropriate for the two statistics used, 


Zither the randomized-blocks analysis of 
variance or the 


L test makes sense only 
in a situation in which we would expect 
the measures within each block to have 
"something in common," The model for 


either design is one in which each ob- 
servation is assumed to consist of the sum 


of the grand mean (M,), a treatment ef- 
fect (T;), a block effect (w;), and a ran- 
dom error (ej) which includes any block 
X treatment interaction, or Xy= M, + A 
+ w;+ eg (Winer, 1962). In the com- 
puter operations of the above article, Tis 
were added to successive random deviates 
to produce the various violations of the 
null; no block constants were added, how- 
ever. Treatment of the data collected as 
a kXn matrix mean that the k random 
deviates that happened to fall first in each 


feren 
of Type 
neglect to es 


sample were considered as having some- 
rae in common; the k random deviates 
that were second in each sample were con- 
sidered as having something in common, 
etc. The computer simulation of the data, 
however, was by procedures in which no 
“block effects” were present, so the data 
generated were of the form X; = d 
Ti+ eş. In this case, the appropria d 
analysis of variance is the simple pe 
domized independent-groups design, a 
an appropriate ordered-alternative distr! 
bution-free test for comparison would be 
the Jonckheere (1954) test rather than 
the L test. If a graduate student came 
with experimental data collected by pro- 
cedures analogous to those used in the 
above study and asked which test to run, 
the randomized blocks or the L test, the 
answer would clearly be neither. 

The randomized-blocks design has a 
major assumption not associated with the 
independent-groups design, namely, the 
requirement of equal correlations (across 
blocks) between all treatments. Boersma 
et al. have met this assumption, since all 
correlations obtained in their data are 
samples from a population with a cor- 
relation of zero; it does seem a quite dubi- 
ous practice however to encourage ES to 
generate random correlations between their 
observations from independent groups in 
order to apply either the randomized- 
blocks design or the L test, While such 
procedures should have no long-run effect 
on the risk of a Type I error, they do 
have an adverse effect on the risk of à 
Type II error. Running a randomized- 
blocks design on such a set of data results 
in a loss of power due to the fact that 
n—1 degrees of freedom are used to as- 
Sess the nonexistent block effect, This 
n — 1 degrees of freedom is thus lost from 
the error term and results in an increase 
in the size of the F statistic needed to re 
ject the null hypothesis, For example, in 
Boersma et al/'s Figure 1, the critical 
value of F for the .05 level would be 4.46 
(df — 288), while the critical value in an 
independent-groups design would be 3.89 
(df— 242). The power curve for the 
correct F test would be raised substan- 
tially in the middle portions due to 2 
Change of this magnitude. The magni- 
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tude of change in such comparisons de- 
creases as both b and » increase, but all 
such power curves would be somewhat 
higher than the corresponding F curves 
obtained by Boersma et al. 

The use of empirical sampling studies 
to investigate the characteristics of sta- 
tistical tests under conditions where 
mathematical solutions are not yet avail- 
able seems quite worthwhile, but only 
when the computer simulation generates 
the kind of data for which the test is ap- 
propriate. Although the present author 
is sympathetic with Boersma et al. in 
their desire to reduce the risk of a Type 
II error, this should not be done by ig- 
noring the risk of Type I errors, nor by 
ignoring the characteristics of the test 
when other alternatives than the single a 
priori prediction are true. Considering 
the usual limited theory back of most a 
priori predictions in psychology, the use 
of the L test seems like shooting at à 
sparrow on the wing with a rifle. If your 
aim is precise, a kill is assured, but there 
is an awful lot of space in which to miss. 
The shotgun approach of an omnibus test 
seems more suitable even in situations in 
which there is some basis for prediction. 
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IATION THEORY AND THE "SINGLE-STAGE" S-R MODEL: 
MEDI E 
^ DIFFERENT? 


LEON A. JAKOBOVITS ! 


Center for Comparative Psycholinguistics, University of Illinois 


i i i videly held notion of 
^ ument presented in this paper is that the widely l : 
2 ei tee difference between so-called single-stage S-R models 


and 2-stage mediation theories is illusory, and the distinction c 
in fact be maintained in S-R theories of language. 
of verbal behavior and certain phenomena that 
to the treatment given by Osgood, and the basic 


is pointed out. 


It is still fashionable today to teach 
students in various areas in psychology 
that there are two kinds of S-R models: 
the so-called single-stage theory vaguely 
identified with "tough-minded" Skinner- 
ians and the two-stage (or multiple- 
stage) mediation theory, equally vaguely 
identified with "tender-minded" cogni- 
tive theorists. The purported inade- 
quacy of the single-stage model to handle 
such ubiquitous phenomena as semantic 
generalization and symbol manipul 
is quickly pointed out, and the 
stage model is then discarded as 
relic of an earlier and more primitive era 
in psychology. It is the Purpose of this 
paper to show that the widely held dis- 
tinction between existing single-stage and 
mediation theories is illusory and that 
the explanatory power of the two“ 
of theories is actually identical. 


ation 
single- 
a useless 


types” 


THE Mepration PARADIGM 


There are many specific versions of the 
mediation paradigm. Perhaps its most 
general formulation has been recently 
restated by Goss (1961) as follows: 
When Events A, B, and C occ 
sequences 4 — B —^ C... and the proba- 
bility of the occurrence of C is actually or 
potentially greater or less when C is preceded 


ur in the 


! [ am grateful to C. E. Osgood and B. F. 
Skinner for their criticisms of an earlier draft 
of this paper. This acknowledgement should 
not be construed as an indication of either 
agreement or disagreement on their part with 
the arguments developed in this final version, 
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annot 
Skinner's account 
relate to it is compared 
similarity between them 


by 4 — B than when C is preceded by. A 
alone, B can be described as a mediating 
process or event [p. 286]. 


At this level of discourse, the mediation 
paradigm is neutral with respect to 
single- or two-stage S-R models. In- 
deed, as Goss shows, there was "re- 
markable commonality" in their con- 
ceptualization of the mediation paradigm 
between Associationists (Locke, Mill). 
Structuralists (Wundt, Titchener), and 
Early Functionalists (James, Dewey). 
One might add with confidence here 
Behaviorists, such as Watson, Lashley, 
and Hull, as well as contemporary 
theorists such as Woodworth, Skinner. 
Hebb, Osgood, and Mowrer. 

Disagreements arise as soon as one 
compares more detailed formulations of 
the mediation paradigm which attempt 
to specify the nature of the Events A, P 
and C: ideas, emotions, mental states 
reactions, neural processes, etc. 

Overt Versus Covert Events? 

Explanations of the 
tween single-stage 
models usually 
Presence or 
to covert, 


distinction be- 
and two-stage S-R 
come down to the 
absence of explicit reference 
Nonobservable events. For 
example, in one influential version of the 
mediation hypothesis (Osgood, 1953) the 
distinction js represented as follows :? 


? In S-R Paradigms such as those below, it 


is useful to distinguish between learned S-R 
habits (solid lines) from unlearned R-S oF 
T-S connections (dashed lines) where the 
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single stage 


S ————— R overt elements 
two stage 


S— rn,--^s,— R overt and covert 


The covert events tm--> Sm are said to 
mediate the overt events S—> R. Note, 
however, that the single-stage paradigm 
could easily be rewritten to include 
mediating events: 


S—R-S—R 


This is the paradigm for response chain- 
ing, and it fits the definition of the 
general mediation paradigm given above. 

In the light of this discussion, it would 
be incorrect to oppose the mediation 
hypothesis to single-stage theories. Since 
there is no theoretical restriction on the 
number of mediating events, it would 
also be incorrect to contrast single-stage 
versus two-stage or multiple-stage theo- 
ries. It is tempting then to settle for a 
distinction which would contrast the 
presence or absence of covert references 
in the theory. However, it is impossible 
to find among the known S-R formula- 
tions (of the so-called single-stage type) 
any that do not make an explicit refer- 
ence to covert responses. Furthermore. 
there is no logical necessity for mediation 
theories (of the so-called two-stage type) 
to insist on the presence of covert events. 
Theorists like Osgood have emphasized 
the covert nature of mediating events 


for two reasons. One relates to the 
difficulty of explaining certain phe- 
nomena, such as semantic generalization, 
where there are no obvious overt events 
that can account for the generalization 


obtained. The second reason relates to 
for the organ- 


the functional advantages s 
ism if mediating events can ea y 
Both Osgood (1953, PP- 


become covert. o 
397-398) and Skinner (1957, pP. 
437) have stated these points. 

Given the fact that all S-R formula- 
tions refer to covert events, the question 
' s i ufficient and 
occurrence of the response is à S! 
necessary condition for the occurrence of the 


self-stimulation. 


can be raised (Fodor, 1965) whether 
these covert events have identical prop- 
erties in the so-called single-stage and 
two-stage models. Thus, Fodor argues 
that the covert mediating events in the 
single-stage paradigm retain a one-to-one 
relationship to overt events and hence 
have no excess meaning and no ex- 
planatory power. Indeed it is said that 
the theoretical usefulness of covert 
mediating responses in the two-stage 
paradigm lies in their relative independ- 
ence from direct environmental stimula- 
tion (e.g. as in symbolic manipulation 
and thinking). However, close examina- 
tion of existing theories reveals that the 
concept of covert mediating response 
always entails relative independence 
from environmental conditions. Since 
this last statement needs justification, 


the theories of Watson and Skinner will 
men are 


be briefly examined. Both 
identified in the minds of many psy- 


chologists with an empty organism, 
blackbox approach. 


VERBAL MEDIATING RESPONSES 
AND SELF-STIMULATION 


Watson and Skinner probably owe 


their reputation for rigidity about theo- 
retical assumptions dealing with un- 
observables to unjustified claims to 
objectivity when talking about their 
theoretical systems. But it is not so 
much what they say about their theoriz- 
ing as what they do in their theorizing 
that concerns us most in this discussion. 
Goss (1961) has argued convincingly 
that Watson's formulations of language 
Jearning and symbolic processes were 
forerunners of today’s mediation models, 
and Watson's diagrams showing these 
relations in S-R terms "anticipate those 
which are now largely identified with 
Hull [p. 289]." Development of lan- 
guage habits, according to Watson, 
parallels the development of motor 
habits (manipulative, locomotive, and 
perhaps emotional, visceral responses). 
At first, motor habits alone are estab- 
lished. Later the individual finds him- 
self in situations where both motor and 
labeling responses are called for. In 
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VN. ization or elimina- 
the end, with pce pete the 
tion of other uus Fro Base eicit 
stimuli may come cal responses may be 
only? ‘These iad At this stage— 
either overt php point for the 
and this s mientecthe vocal responses 
present wes stimuli for (ie., mediate) 
may aci mela responses or other 
pescara This property, by 
vod these chained covert events (vocal 
— nonvocal) act as both stimuli and 
oc eponees, is referred to here as self- 
malia and it is this quality which 
renders the covert mediating responses 
relatively independent of environmental 
stimulation. " 

Watson's affirmation that these medi- 
ating responses are peripheral rather 
than central is of no crucial consequence 
to the present argument. The centrality- 
peripherality controversy which raged 
half a century ago has now subsided, not 
because the motor theory of thought has 
been disproved, but because neurolo- 
gizing in psychology has become generally 
acceptable. Yet, even today, we find 
theorists, such as Osgood, unwilling to 
commit themselves to either position. 
The nature and locus of “the little rm” 
remains unspecified (cf. Osgood, 1953, 
pp. 697-698; Osgood, Suci, & Tannen- 
baum, 1957, pp. 7-8). 

Skinner’s (1957) book, Verbal Behavior, 
is often cited as an example of the weak- 
ness of a simple S-R approach to lan- 
guage behavior and higher symbolic 
processes (e.g., Chomsky, 1959; Osgood, 
1958). The criticisms leveled against 
Skinner's so-called single-stage S-R 
theory center around the notion that 
symbolic activity, meaning, semantic 
generalization, and the like, require in- 
ferred mediating processes that are 
relatively independent of overt stimula- 
tion. Despite what Skinner may say 
about the methodological difficulties of 
theorizing about unobservables and his 
stated reluctance to do so, the important 


* Osgood (1961) has argued that the form 
of his representational mediation Process (rm) 
is nonverbal. In this respect, he differs from 
Watson or Bousfield, but not from Skinner, 
as will be shown below. 
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point to realize is that he does make 
extensive use of inferred mediating 
processes. Again, this statement needs 
justification, and two examples will be 
cited here to support the argument. 
The first involves Skinner's interpreta- 
tion of "meaning," while the second 
relates to his description and explanation 
of "semantic generalization.” 


MEANING AND SEMANTIC 
GENERALIZATION 


The description of meaning which 
Skinner gives is remarkably similar to 
Osgood's sign conditioning paradigm— 
remarkable only because of the distance 
which is supposed to separate these two 
theorists. According to Osgood (1953, 
p- 696), the meaning of the word spIDER 
is the representational mediation process 
(na) which becomes conditioned to the 
sign according to the classical Pavlovian 
conditioning paradigm: 


spider- 


S — Ry unconditioned 
object 


total reaction 


/ 
the sign, [S2r,-»s,—R x instrumental 


SPIDER response in- 


dependently 
conditioned 


In this diagram, we can distinguish 
Osgood's two stages, The first is the 
classical conditioning of a meaning re- 
sponse (rm) to the verbal symbol SPIDER- 
The second, independent of the first, 15 
an instrumentally conditioned response 
(Rx) to the self-stimulation (sm). An 


example of Ry is the overt verbal re- 
sponse ‘snake’, 


Now compare to this Skinner's (1957) 
following account: 


But when we get som 


eone to laugh by telling 
a funny story, 


s we use stimuli conditione 
according to the classical Pavlovian pattern. 
If a verbal stimulus frequently accompanies 
Some state of affairs which is the uncond!- 
tioned or previously conditioned stimulus for 
an emotional reaction, the verbal stimulus 
eventually evokes this reaction. Thus if ove 
is afraid of snakes, and if the verbal stimulus 
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"snake" has s i 

E os a sometimes accompanied real 
"i kes, the verbal stimulus alone may evoke 
an emotional reaction [p. 155] ! 


Ur aen symbols may acquire the 
RUE of evoking emotional reactions 
dnd E representative of uncondi- 
betes esponses not proper to the verbal 
nulus as such. Furthermore, the 
overt verbal response, "snake", is inde- 
pendent of these emotional reactions, as 
attested by the following remarks: 


The 4 er 

Es emotional reaction is usually a by-product 

ki some other verbal function. The verbal 

M ye does not establish the response 

o5 hes primarily to evoke such a reaction 
e part of the listener Skinnei 57 

p. 155]. f picis 


In short, both Osgood's and Skinner's 
accounts involve the notion of two 
adi pendant procene in meaning: clas- 
Pere y conditioned representational re- 
S to symbols, and instrumentally 
Perl ioned (operant) verbal responses. 
ia haps because Skinner does not use 
diagrams, these relations are less clearly 
perceived in his, as compared to Osgood's, 
account. 

With respect 
semantic generalizatio 
P. 155) cites Diven's 
demonstration of this phenomenon, while 
Osgood (1953, p. 701) describes in some- 
What greater detail a number of such 
experiments involving generalization of 
the galvanic skin response. According to 
Osgood, generalization of an overt re- 
sponse from one symbol to another physi- 
cally dissimilar symbol (e^ STYLE to 
FASHION) is mediated by the common- 
Meaning response shared by the two sym- 


bols. Skinner's (1957) account of the same 
Phenomenon, which he cites as an €* 
ample of “metaphorical extension is 
discussed in connection with the words 
that Shakespeare puts in Romeo's mouth: 

Juliet is the sun." His argument runs 


as follows: 


to the problem of 
n, Skinner (1957, 
(1937) classic 


on seems to occur 
y between stimuli expres- 
ysical science can be 
veral possible ex- 

have a common 


ne extensi 


demonstrated. There 
Planations. Two stimuli í 
effect upon the respon 


mediates the extension of the ri 

m example “Juliet is the sini" icis pos to 
that a physical similarity could ni E be 
plausibly established. Only to Rome: did 
Juliet glow with the light of the em. The 
metaphorical extension might have bei " 
mediated by, say, an emotional Wu ran 
mie the sun and Juliet evoked in him 


The reader can judge for himself on the 
similarity of this account by Skinner and 
an account which Osgood would give of 
the same event. 


THINKING 


Skinner has been accused of shunning 
the problem of higher mental processes 
and yet he has stated (p. 434) that AS 
account of verbal behavior would not be 
complete until the problem of thinking 
is discussed. One should not be dis- 
tracted by his warning that “verbal be- 
havior which cannot be observed by 
others is not properly part of our field 
p. 434]" when, in the same paragraph, 
he himself can disregard the warning as 


impractical : 
o avoid the problems it raises 
rselves to observable events, 
letting anyone extend the analysis to his own 
covert behavior who wishes to do so. But 
there would then be certain embarrassing 
gaps in our account. In intraverbal chaining, 
for example, necessary links are sometimes 
missing from the observable data. When 
someone solves à problem in "mental arith- 
he initial statement of the problem 
| overt answer can often be 
ly by inferring covert events. We 
t for verbal behavior which 
| of covert speech—which 
) or qualifies it with 


It is tempting t 
by confining ow 


related on 
also have to accoun 
js under the contro 


reports it (Chapter 5 
autoclitics (Chapter 12). Covert behavior 
has also had to be considered in discussing 


grammar (Chapter 13), sentence composition 
Chapter 15), and other 


(Chapter 14), editing ( ; ; T 
topics of Part V. Some discussion of its 
dimensions is therefore required [p. 434]. 
Skinner is not opposed to the logic of the 
centralist position : “physiological proc- 
esses mediate the probability of covert 
and overt responses alike . . + [p. 435]" 
ay not be inclined to 


even though he m k : 
make guesses about physiological medi- 


ators. He shares with Osgood, Maltz- 
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(E i hers 

an, Staats, N. E. Miller, and ot Ts, 
the belief that “the data UAM 
to the notion of covert spee «s 
dealt with the degree of rigor pres i g 
elsewhere in a science of verbal behavior 
at the present time [p. c € 
It was stated above that the property 
of self-stimulation is a crucial character- 
isti f inferred mediating. processes in 
ae it allows for relative independence 


of environmental stimulation. Accord- 
ing to Skinner: 
. the range of verbal behavior is roughly 


suggested, in descending order of energy, by 
shouting, loud talking, quiet talking, whisper- 
ing, muttering “under one's breath," sub- 
audible speech with detectable muscular 
action, subaudible speech of unclear dimen- 
sions, and perhaps even the “unconscious 
thinking” sometimes inferred in instances of 
problem solving (p. 438]. 


When it is realized that “subaudible 
speech of unclear dimensions” (neural?) 
may be “under the control of" or “medi- 
ated by" other elements of the same 
nature (ie., unmeasurables), it then 
becomes quite clear that Skinr 
count of thinking is as far remo 
the external environment as a 
avowed mediationists, 


is thus a central conce 
theory : 


ner’s ac- 
ved from 
ny of the 
Self-stimulation 
pt in Skinner's 


The speaker's own 
matically supplies stir 
or intraverbal behav: 
generate stimuli for fu 


Verbal behavior auto. 
muli for echoic, textual, 
tor, and these in turn 
rther responses [p. 4397. 


Since automatic reinforcement 
the speaker's own verbal behav 
respect the contingencies which 
external environment, co; 
can be "stretched" at will, beginning perhaps 
with a slight exaggeration but leading even. 
tually to fiction and lying [pp. 441-442), 


[provided by 
ior] need not 
Prevail in the 
ntrolling relations 


The importance of self-stimulation in 
Skinner's system is attested by the 
central role relegated to his concepts of 
"self-mand" (which functions as self- 
reinforcement), "'self-tact" (which serves 
a self-controlling function), and “covert 
self-editing” (verbal behavior under the 
Control of “private stimulation”), 
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It may be argued (c.g, Chomsky. 
1959) that the expression “under the 
control of private stimulation" repre- 
sents a gratuitous usage of the term 
"control." This criticism, whether valid 
or not, is irrelevant to the present argu- 
ment. Similar criticisms can be raised 
with respect to the circularity in the 
definition of S and R, the term, “reinforce- 
ment," the application of the concept of 
semantic generalization to the stylistic 
usage of metaphorical expressions, or to 
the use of the concept of response to refer 
to unspecified neural processes. Critic- 
isms raised by Chomsky (1959) in his re- 
view of Skinner's Verbal Behavior represent 
a linguist's indictment of all types of S-R 
models in language behavior, Watsonian, 
Osgoodian, and Skinnerian alike. They 
Will have to be dealt with elsewhere.' 
What is asserted here is that the Skinner- 
ian concept of mediated self-stimulation 
has the same theoretical characteristics 


as Osgood's representational mediation 
process, * 


The impression should 
here that Skinner's critics 
the inconsistency betwee 
objectivity and the use of 
mediated self-stimulation ; 
good (1958), commenting on Skinner's 
use of the “extended tact" to explain 
semantic generalization states: 


not be givea 
are unaware of 
n his claim to 
the concept of 
- Thus Os- 


To say that it occurs ' 
pristine objectivity of 
nates any re; 
and tw 


'covertly" cancels the 
the system and elimi- 
al distinction between single-stage 
"stage models, 


It must be emphasized, however, that there 


is nothing necessarily incompatible between 
What Skinner has done here and an explication 
of representational (symbolic) Processes. . . - 
. ‘ The seriousnes: 

indicated 


nt no major at- 
heorists has ap- 


_ Psychological literature to 
isms. 


"odor (1965), but 
based 9n an inadmissible reduction 
of Osgood's mediation theory to a fictional 
single-stage S-R paradigm, 
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Having agreed that there are implicit non- 
verbal processes in behavior, Skinner does not 
go on to explicate their nature and function. 
This is the major insufficiency of Verbal Be- 
havior . . . (p. 212]. 


Clearly, then, the charge is not one of 
incompatibility. 

Skinner has “always regarded the last 
few chapters in Verbal Behavior as the 
most important” and believes that “the 
possibility of analyzing thinking is one 
of the major contributions of the book." 
In view of this, it is ironical that his 
analysis of thinking (which appears as 
the last chapter in the book) has often 
been summarily dismissed as an after- 
thought inconsistent with his general 
system. 

6B, F. Skinner, personal communication, 


February 1965. 
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i NSOLIDATION AND ELEC 
Er SHOCK REEXAMINED 


MES L. McGAUGH ^x» LEWIS F. PETRINOVICH 
JA} : 


; i 7 State University of New York, 
A ia, 
T of A a pie D oh 


i 5 t the 

ent paper Lewis and Maher (1965) concluded tha : 
bee electroconvulsive shock (ECS) studies provide little 
support for the consolidation hypothesis, and they proposed the 


E 
During the past 2 decades, a large num- answer the latter question, It is unfor- 7 
ber of studies have shown that electro- tunate that they did not consider their 

convulsive shock (ECS) impairs reten- more general question, for there is a 

tion if the ECS treatment is administered rather substantial amount of data indi- 

within a short period of time after the cating that memory stor; 
original learning (Glickman, 1961; Mc- pendent (e.g. McGaugh, 1965; McGaugh 

Gaugh, 1965). The most generally ac- & Petrinovich, 1965). 
cepted interpretation of these findings is search has been publish 
that the ECS treatment interferes with lication of Glickman’ 
Processes involved in the consolidation or i 


Storage of memory traces, In a recent dation hypothesis, 


review Lewis and Maher (1965) at- On the basis of their analysis of some f* 
tempted to“... . take up the major ofthe ECS research literature, Lewis and 
experimental variables in which ECS Maher: ( 


i a) concluded that very little, if 
has been involved and to evaluate the any, of the ECS liter: 


data derived from their manipulation in- consolidati 
sofar as it pertains to consolidation [p. sented an 
226].” They also posed the question: ECS effects 
“Are there adequate data to substantiate paper we 
the notion that a period of time following literature reviewed by Lewis and Maher 
learning is necessary for an engram to isi olidation h oth 
be fixed [p. 226]?” These are, of course, esis; (b) contrary to th D 
two e user Tic Question of Lewis and Maher there 
whether the findings o have any amount of quite a i 
implications for consolidation theory is ECS affects ogo ees ey that 
one; the question of whether Consolidation i 

theory is supported by any adequate data 
is still another. In restricting -their re- 


ature supports the | 
pre- 
€rpretation of 
on behavior, In the present 
Point out that: (@) Much of the 


view to studies using ECS, Lewis and ogical groun ds. empirical and 
Maher could not, of course, attempt to x , ; 
NALY. : " 
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MH-10261, and MH-06096-0351 from the W or ECS Resrarcy 


National Institute of Mental Health, United 
States Public Health Service, and by Re. du 
search Grant GB1612 from the National Sci- 
ence Foundation. 


The analyses pres 


s ented in the intro- 
ction and review s, 


ections of the Lewis 
and Maher paper are, for the most part, 


irrelevant to the consolidation hypothesis, 
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For example, Lewis and Maher (1965, p. 
226) point out that much forgetting is 
caused by proactive inhibition and that 
this phenomenon is difficult to encompass 
within a consolidation framework. This 
is true. It is also irrelevant. It has not 
been claimed that interference with con- 
solidation is the sole cause of “forgetting” 
(i.e. failure to recall); it has only been 
claimed that it may be ome cause of 
forgetting. Consolidation theory is not 
only a hypothesis of failure to recall, it is 
also (and primarily) a hypothesis of 
memory storage. And, as such, it has not 
yet been completely abandoned even by 
researchers in verbal learning (e.g., Mad- 
sen, 1963; Walker, 1963). 

In the last section of their review, 
Lewis and Maher acknowledged that sev- 
eral experimenters (e.g. Hudspeth, Mc- 
Gaugh, & Thomson, 1964 2) have pointed 
out that the behavioral effects of a series 
of ECS treatments are quite different 
from those produced by a single ECS 
treatment. Retrograde amnesia can be 
produced with a single ECS treatment. 
Other effects (including aversive effects) 
are produced when ECS is administered 
repeatedly. In view of the fact that re- 
peated ECS treatments are known to 
produce both amnesia and other effects 
which influence behavior, such studies 
cannot provide the best test of the 
consolidation hypothesis. Unfortunately 
most of the experiments cited by Lewis 


and Maher involved the use of multiple 
It is clear that uncon- 
ncerning memory 
consolidation must be obtained from ex- 
periments in w ECS 


treatment is adm 
which the effects of the first and subse- 


quent treatments can be readily dis- 
tinguished). Since this problem was 
first clearly recognized, a large number 
of relevant studies have been published. 
In addition to those cited by Lewis and 
Maher (Chorover & Schiller, 1965; 
Heriot & Coleman, 1962; Hudspeth et al., 


2 [ncorrec 
pe 233) 
cited 


tly cited by Lewis and Maher 
as McGaugh and Hudspeth, 


in the references of their 


1964; Pearlman, Sharpless, & Jarvik, 
1961; Weissman, 1963), there are others 
(Chevalier, 1965; King, 1965; Leonard & 
Zavala, 1964; Madsen & McGaugh, 1961; 
McGaugh & Madsen, 1964; Quarter- 


main, Paolino, & Miller, 1965; Tenen, 
1965: Weissman, 1964). Since all 0 
these studies found clear evidence that 
retrograde amnesia is produced by a 
single ECS, Lewis and Maher’s (1965) 
conclusion, “It seems quite possible that 
the consolidation notion has not yet been 
tested at all [p. 237],” is obviously un- 
justified. 

Lewis and Maher further state that 
studies using more than one training trial 
are irrelevant to the consolidation hypoth- 
esis. We do not agree with this conclu- 
sion. On the contrary, the series of ex- 
periments by Thompson and his colleagues 
(e.g, Thompson & Dean, 1955; Thomp- 
son & Pennington, 1957) has shown that 
a single CS produces retrograde amnesia 
if the treatment is administered within a 
short period after animals are trained 
(with massed trials) to a relatively 
stringent criterion. And, as would be 
expected according to the consolidation 
hypothesis, the amount of amnesia pro- 
duced by à posttraining ECS treatment 
varied inversely with the length of the 
val used during the training 


(Thompson & Pennington, 1957). | 
this latter experiment, 


to ones invo| ing. em 
to the conclusion o Lewis and Maher, 


marce varia! 
if prem distribution of practice gen- 
erally results in erior retention as 
well as superior learni 
gt pen of findings of numerous 
studies using 2 single ECS treatment, 1 
is known that the length of a ne 
ing interval during hich an : reat 

ill produce some degree o amnesia 
mede a function of many 


vari derably 45 
variebies, including task complexity, de- 
gree 0 "training and age (Glickman, 
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is and Maher pointed out that 
Sele gon Schiller (1965). ure 
reported an experiment in which ECS 
duced retrograde amnesia only when 
T iministered within 4 minute after a 
training trial. The fact that memory 
storage with this task and conditions was 
somewhat more rapid than that obtained 
in other research is interesting, but it 
does not mean, as Lewis and Maher sug- 
gest, that this particular study “, . . is 
the only ECS study relevant to consoli- 
dation [p. 233]." Relevancy of experi- 
ments is determined by examining the 
appropriateness of the designs of experi- 
ments, not their outcomes, And, as we 
have indicated, numerous appropriately 
designed (i.e., relevant) studies have pro- 
vided extremely strong evidence that 
memory storage is time dependent, 
At the conclusion of the review section 
Lewis and Maher wrote: 


he Possibility of such 
tests may soon vanish [p, 233]. 

Thus they seem to b 
are any data requi 


In view of this, it is Puzzling why, in the 


Conprrionep-In gri Hvrornzsis 


The hypothesis Proposed by Lewis and 


Maher is essentially as follows, ECS is 
assumed to be an uncondition 


which produces an uncond 


re- 
Sponses.3 

*In an earlier version of this general 
hypothesis (Lewis & 


Adams, 1963) the 
amnesia was assumed to be due to condition. 


1 a convulsion to cues 
Present during ECS administration, 


Any new hypothesis or theory has two 
functions to perform. First, it should 
provide an explanation of existing data, 
and, if possible, it should resolve some 
seemingly conflicting observations. At 
the very least, it should not ignore avail- 
able data. Beyond that, any new hypothe- 
sis should have a heuristic role: It should 
suggest new experiments or new observa- 
tions, Unfortunately Lewis and Maher 
made little effort to show in any detail 
how the conditioned-inhibition hypothesis 
might explain the diverse findings avail- 
able at the tim 
Beyond that, the hypothesis fails com- 
pletely to explain at least one s 


Pothesis appears 
de explanations of a 


nhibition hypothesis 
offers no explanation of the differential 


effects of single and Tepeated ECS treat- 
ments. There is extremel i 


duced by a 


d single ECS treatment 
administered 


Shortly after learning. 
(eg. aversive effects and 


ered Tepeatedly (e.g., 


Hudspeth et al. 
1964; McGaugh & Madsen, 1964). It is 
h 


low the conditioned-in- 
S would explain these 


as induced [p, 236]." 
t in most ECS experi- 


e training apparatus, F 
cent direct tests 


of this hypothesis 
Leonard & Zavala, 1964; ( uartermain 
ct al, 1965) the “mount of retrograde 
amnesia Produced p 


i O explain in terms of the condi- 
Honed-inhibition hy 


- The conditioned-inhibition hypothesis 


_— 


> — X -_ 
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relies solely upon evidence from studies in 
which amnesia is indexed by increases in 
latencies. Failure to move or failure to 
move within a certain period of time is 
taken as evidence of amnesia in tasks re- 
quiring the animal to move in order to 
avoid a punishment or receive a reward. 
Thus, the findings of Duncan (1949) 
may be interpreted in this way. How- 
ever, in most (but not all) recent studies 
using ECS the animals were trained to 
inhibit a response in order to avoid a 
punishment. According to the condi- 
tioned-inhibition hypothesis, ECS should 
enhance inhibitory avoidance learning 
(so-called passive avoidance); that is, 
the latencies should increase following an 
ECS treatment. A single ECS adminis- 
tered in the experimental apparatus does 
not produce increased latencies. How- 
ever, a single posttrial ECS treatment 
even when administered outside of the 
apparatus clearly impairs inhibitory 
avoidance learning (e.g. Heriot & Cole- 
man, 1962; Hudspeth et al., 1964; King, 
1965; Madsen & McGaugh, 1961). In 
addition, there is extensive evidence 
(e.g, Glickman, 1961) that ECS pro- 
duces amnesia in discrimination-learning 
tasks where differential responding rather 
than latency is used to measure learning 
and memory. Any hypothesis concerning 
the amnesic affects of ECS must consider 
these findings. 

4. According to the conditioned-inhibi- 
tion hypothesis, posttrial ECS treatments 
should, by inducing relaxation, facilitate 
extinction. However, ECS treatments 
have been found to impair extinction of 
both approach and avoidance responses 
(Gerbrant & Thomson, 1964; Madsen & 
Luttges, 1963). 

5. Finally, Lewis and Maher (1965) 
concluded that: "Any procedure such as 
anesthesia that renders the animal un- 
conscious before passage of current will 
prevent the establishment of an ECS- 
produced inhibition . . . [p. 237]." Am- 
nesia would thus not be expected with 
h an experimental procedure. As 
Maher pointed out, anesthetics 
1 to prevent the occurrence of 

produced by electroshock 

and the evidence does indi- 


suc 
Lewis and 
can be usec 
convulsions 
stimulation, 


&e 
ao 
I 


Ne Poot nt at 


Cer dra ECS ne 


aco IU acs as 
ve E Be 
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TRUATMENTS. 


Fic. 1. Effects of posttrial treatments of 
ECS, ether anesthesia, and ECS delivered 
during ether anesthesia on one-trial avoid- 
ance learning in mice. (From McGaugh & 
Alpern, 1966. See text for details.) 


cate that the effects produced by a 
series of ECS treatments are not found 
when the electroshock stimulation is ad- 
ministered while the animals are anesthe- 
tized (e.g., Friedman, 1953). However, 
the results of recent experiments clearly 
indicate that the retrograde amnesia pro- 
duced by a single ECS is obtained even 
when the electroshock stimulation is de- 
livered while the animals are anesthetized. 
The results of one study (McGaugh & 
Alpern, 1966) are shown in Figure 1. 
The left side of Figure 1 shows the per- 
formance on a second trial of mice 
given a foot shock on a previous day as 
they stepped from a small platform 
through a small hole into a darkened box. 
The performance of mice given an ECS 
while lightly anesthetized with ether (re- 
sulting in no convulsion) was comparable 
to that of animals given a normal ECS 
treatment. Both groups showed less 
retention than groups given the same 
treatments 1 hour after the trial The 
amnesia found with ECS appears to be 
due to the electrical current rather than 
to the convulsions. Control groups 
shown on the right side of F igure 1 indi- 
cate that when no foot shock is given on 
the first trial, the treatments of ether, 
ECS, and ECS under anesthesia do not 
affect retention. Comparable results with 
rats and with other anticonvulsants have 
been obtained by Weissman (1965). 
Thus, this evidence indicates quite clearly 
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i not, as Lewis and 
Se retrograde 
amada. Anesthesia does, however, seem 
to attenuate the pseudoamnesia as br = 
the aversive effects produced by multiple 
ECS treatments (Friedman, 1953). 


CONCLUSION 


ree with Lewis and Maher's 
(1968) pa that: “The effects of 
ECS are undoubtedly very complicated 
[p. 237].” However, it seems highly un- 
likely that the conditioned-inhibition 
hypothesis produced by Lewis and Maher 
will be shown to be capable of explaining 
the diverse effects of ECS on behavior. 
In particular the conditioned-inhibition 
hypothesis seems unable to account for 
the details of studies of the amnesic ef- 
fects of ECS. The conditioned-inhibition 
hypothesis, although perhaps superficially 
appealing, is clearly inadequate on both 
logical and empirical grounds. Studies of 
the effects of ECS on learning continue to 
Provide support for the hypothesis that 
memory storage is time dependent. Thus, 
the consolidation hypothesis not only 
".. . remains appealing" (p. 237), it re- 
mains supported. Tt does not appear that 


the conditioned-inhibition hypothesis will 
fare as well, 
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Retrograde amnesia due to electroconyulsiye 
memory or response deficit for events that hay 


prior to the convulsion, 


S of man 
pothesis are pointed out. A cond: 
amnesic effect of ECS is also def 


Some of the questions that have been 
raised by McGaugh and Petrinovich 
(1966) require careful answer. Others 
may be disposed of rather briefly. We 
shall consider the latter group first and 
save the main bulk of the space at our 
disposal to examine the central issues. 

First, we must draw attention to the 
fact that by separating two sentences that 
appeared together as a continuous propo- 
sition in the original paper (Lewis & 
Maher, 1965) our critics have managed 
to convince themselves that we fail to 
appreciate that the consolidation hypothe- 
sis might draw support from studies un- 
related to the electroconvulsive shock 
(ECS) technique. At the risk of tedious 
repetition, we would like to repeat the 
original statement as it was written: 


Our approach will be to take up the major 
experimental variables in which ECS has 
been involved and to evaluate the data de- 
rived from their manipulation insofar as it 
pertains to consolidation. Are there adequate 
data to sbstunsete the notion that a neon 

ime following learning is necessary for 
= un to become fixed [p. 226]? 


These sentences, considered separately, 
might have referred to two separate ques- 
tions. In the context and Sequence in 
which we wrote them, it is quite clear 


i ; by 
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shock (ECS) is a 


paper Lewis and 
s, which have so 
on point of view, 
that paper are answered, 


y of the single ECS studies for this hy- 


itioned-inhibition hypothesis for the 
ended. 


that they refer to one question—namely, 
the relationship of ECS data to the prob- 
lem of the neural consolidation hypothesis. 

The second matter is their disagree- 
ment with our statement that the evidence 
is quite strong that the effect of massed 
trials is due to performance rather than 
learning variables. Legitimate differ- 
ences of opinion are possible, but not fre- 
quent, on this topic. and we believe the 
better evidence and opinion support our 
point of view. In discussing this matter, 
Kimble (1961, P. 125) says: 


The evidence that what is involved is a 


performance factor comes from a variety of 
demonstrations that the effect of massed 
practice is temporary and that the decre- 
mental process dissipates spontaneously with 
rest. 


In any case, faith in the effects of trial 
distribution is a Shaky peg on which to 
hang the consolidation point of view. 
Third, McGaugh and Petrinovich raise 
a question concerning the adequacy of 
our coverage of the ECS literature. It is 
generally difficult for writers to cite 
papers that have not yet appeared in print. 
For McGaugh and Petrinovich to state 
that we did not consider the studies of 
Chevailier (1965), King (1965), Quar- 
termain, Paolino, and Miller (1965), 
and Tenen (1965) is quite accurate. For 
them to complain about it and to add that 
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therefore our conclusions were “obviously 
unjustified” is simply silly. 

McGaugh and Petrinovich want us to 
believe that the existence of retrograde 
amnesia itself must be taken as support- 
ing a consolidation notion. We must 
make it clear that the question at issue is 
not whether ECS produces retrograde 
amnesia—if by this is meant impairment 
in performance of a previously learned 
response. The data are generally very 
clear on this point; ECS does tend to 
lower—at times to zero—írequencies of 
responses that have been acquired at some 
brief interval before the convulsion. The 
question is not the reality of the be- 
havioral deficit; it is the question of an 
adequate theoretical account of its de- 
terminants. Thus the claim that "all 
these studies found clear evidence that 
retrograde amnesia is produced by a 
single ECS [p. 383]” is misleading. 
These studies do demonstrate a perform- 
ance impairment, but they are not con- 
clusive to the more precise issue of 
whether the performance impairment 
(retrograde amnesia) is a consequence of 
disruption of engram consolidation, or 
whether it is due to interference by some 
competing response. 

McGaugh and Petrinovich refer to the 
more recent studies using only one ECS 
as if the conclusions they present are 
somehow unequivocally in favor of the 
consolidation hypothesis. This is by no 
means the case. Tenen (1965) studied 
the effects of single ECS trials on rats 
that had previously approached a water- 
hole while thirsty and received reinforce- 
ment there. Two ECS intervals were 
used, 12 seconds and 3 hours. Examina- 
tion of the rate of responding showed that 
ECS, predictably, was followed by a 
diminution of responding compared to 
groups that did not receive ECS. How- 
ever, the differences between the 12-sec- 
ond and 3-hour groups were not signifi- 
cant—a finding that prompted Tenen to 
suggest that “Gt seems possible that ECS 
might exert some limited retrograde am- 
nesic effects even three hours after rein- 
forcement [p. 2].” But it also appears 
from examination of graphic data pre- 
sented in this study that the amnesia 
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found in the delayed group was not dif- 
ferent from that exhibited by a group of 
rats given simple painful foot shock in 
lieu of ECS. Chorover and Schiller 
(1965a) also criticized the consolidation 
interpretation given to the delayed group. 
The data hardly constitute univocal sup- 
port for an engram consolidation hypoth- 
esis. 

A very large proportion of the experi- 
ments using one ECS employ a situation 
in which a rat is placed on a raised plat- 
form. When he steps off, he receives a 
foot shock that causes him to remain on 
the platform when returned there for 
the next trial. If ECS follows the foot 
shock, however. the rat seems to forget 
the punishment and steps off the platform 
directly on the next trial. Thus, memory 
of punishment has apparently been dis- 
rupted, and interference with consolida- 
tion is the typical guess" as to the 
mechanism for the disruption. But this 
conclusion is far from demanded by the 
data.  Routtenberg and Koy (1965) 
showed that one ECS will increase the 
tendency of a rat to step off a platform 
independent of any avoidance training’ 
associated with it and specifically called 
for a reevaluation of the previous litera- 
ture (p. 288). Chorover and Schiller 
(1966) showed that a single ECS, as ad- 
ministered in typical consolidation studies. 
will produce a relative increase 1m loco- 
motor activity. This increase 1n activity 
results in a marked reduction in avoid- 
ance and leads to the frequent interpreta- 
tion that an engram has failed to con- 
solidate. The effect of ECS in these 
situations is indeed time dependent; the 
question is whether consolidation is a 
volved. We tend to think that th 
evidence does not warrant the conclusion 
that it is. 

]IvPOTHESI!S 

o the conditioned-in- 
hibition hypothesis. It is not d ren 
why McGaugh and pe: a T s 
hypothesis even “superficially a i a6 dt 
as it is quite apparent that they do 
understand it Zhe purpose of the 


is i the explanation 
hypothesis is to 
of ECS data, whether OT not the ECS 


INHIBITION 


Let us now turn t 


attempt 
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studies have been conducted in a manner 
germane to the consolidation theory of 
memory. Our critics accept that multiple 
ECS studies are not very helpful in test- 
ing consolidation notions. We agree. 
However, this does not mean that the 
consequences of multiple ECS treatment 
are not of psychological interest them- 
selves. The amnesic phenomena. pro- 
duced by multiple ECS are just as “real 

as those produced by one ECS, but they 
cannot readily be accounted for in terms 
of engram destruction as many have re- 
cently believed (Thomson, McGaugh, 
Smith, Hudspeth, & Westbrook, 1961). 

McGaugh and Petrinovich are worried 
by the failure of the inhibition hypothesis 
to explain the differential effects of single 
and repeated ECS treatments. They be- 
lieve that effects such as fear are found 
only when ECS is administered repeat- 
edly. Reference to King (1965), cited by 
them, shows that such effects are in fact 
obtained by a single ECS treatment, al- 
though King also states that fear pro- 
duced by a single ECS cannot be the sole 
explanation for the amnesia. 

The Quartermain et al. (1965) study is 
also cited by McGaugh and Petrinovich 
as negative to our interpretation. In our 
paper we said that there was only one 
study (Chorover & Schiller, 1965b) in 
the literature with a learning-ECS inter- 
val sufficiently short to test the consolida- 
tion notion. Now we will amend that 
statement to include the Quartermain 
et al. (1965) report. Jf the consolida- 
tion interpretation has any support from 
the ECS literature, it comes from these 
two studies, and our objections to the 
others still stand. 

Another criticism is perhaps more baf- 
fling. McGaugh and Petrinovich state 
that: "the conditioned-inhibition hypothe- 
sis relies solely upon evidence from 
studies in which amnesia is indexed by 
increases in latencies [p. 384].” This 
could mean that the studies mentioned in 
our original paper (Lewis & Maher, 
1965) under the specific heading, “A 
Conditioned Inhibition Hypothesis,” in- 
clude only such studies. A careful 
scrutiny of pages 234—237 of our paper 
shows this not to be the case. Alterna- 
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tively, perhaps McGaugh and Petrinovich 
mean that the whole paper rests only on 
such studies. This, clearly, is not the 
case either as the citations to the work of 
Brady and his colleagues point specifically 
to the use of increases in rate of bar press 
as measures of ECS effects. The study 
by Williams (1961) points to the effects 
of ECS in reducing hesitations in a run- 
ning response. Many other kinds of re- 
sponse are included in our review. As 
the criticism is somewhat unclear, we 
might venture to suggest what the prob- 
lem is. McGaugh and Petrinovich in- 
correctly deduce that conditioned inhibi- 
tion should enhance inhibitory avoidance 
learning. Inhibitory avoidance learning 
—of which freezing in the conditioned 
emotional response pattern is the exem- 
plar—involves the maintenance of con- 
siderable muscular tension in a highly 
aroused animal. Under conditions where 
the fear is gradually reduced—as by nor- 
mal extinction processes—the reduction 
in arousal is evidenced by an increase in 
the general locomotor activity, Thus, 
the animal that is in a lowered state of 
arousal should be less likely to “freeze”— 
more likely to move around—than one 
that is in a highly fearful state. The 
effect of the conditioned inhibition, we 
hypothesize, is to lower arousal levels, 
not to increase those motor inhibitory 
patterns that are invoked in Passive avoid- 
ance situations. Ina word, the difference 
is between the implications of protective 
inhibition (a state on the general arousal- 
inhibitory continuum) and the inhibition 
of specific motor responses, a matter that 
depends, of course, upon the pattern of 
muscular innervation that a given stimu- 
lus may excite. It is a pity that this 
confusion could not have been avoided 
by our critics. We had anticipated that 
the clear reference to protective inhibition 
(Lewis & Maher, p. 234) would have 
obviated this problem, However, it may 
help at this point to reiterate that under 
Some conditions the shift from high 
arousal to lower states may be accom- 
plished by an increase in locomotor activ- 
ity where the locomotor activity was 
Previously being inhibited as part of the 
high-arousal pattern. Thus the hypoth- 
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esis does not depend upon increased 
latencies only; it may also be used to 
account for the reduction in the latency 
of a response which had been suppressed 
by punishment. 

Contrary to the simplified deductions of 
McGaugh and Petrinovich, the condi- 
tioned-inhibition hypothesis does not in- 
evitably compel the prediction that post- 
trial ECS treatments will facilitate 
extinction. Predictions cannot be made 
in the absence of specific information 
about the levels of arousal appropriate to 
the production of the response and the 
processes involved in extinction. For 
example, to the extent that extinction 
depends upon the occurrence of the re- 
sponse, this being followed by nonrein- 
forcement, then anything that depresses 
the rate of response may serve to slow 
the occurrence of extinction trials, their 
spacing, etc. 

Finally, we should note that the brief 
account of an experiment using ether 
anesthesia. permits us to do no more 
than raise some questions. One crucial 
question relates to the interval between 
the occurrence of the foot shock and the 
administration of ether. From the de- 
scription we must assume that the animals 
in the Ether-ECS/25 seconds condition 
were anesthetized at some posttrial inter- 
val of less than 25 seconds. Thus the 
unconscious state was presumably induced 
fairly rapidly after foot shock. The find- 
ing that this produces retrograde amnesia 
is in line with the work of Pearlman, 
Sharpless, and Jarvik (1961), cited in 
the original review. As we have already 
pointed out (Lewis & Maher, 1965, p. 
236), ". . . if the unconsciousness in- 
duced by the anesthetic is rapid, and done 
in the experimental situation, then it 
should have the same retrograde amnesic 
effects that ECS alone does." We have 
not advanced the view that the convulsion 
is necessary to produce the conditioned- 
inhibition paradigm—simply that rapid 
induction of unconsciousness is necessary. 

We do have some points of agreement 
with McGaugh and Petrinovich. We 
agree that memory (and hence learning) 
must depend upon physiological storage 


processes, and that these will have some 


time-bound characteristics. We agrec 
that multiple ECS treatments are a poor 
way to test this notion. We agree that 
there are many determinants of forgetting 
other than simple response competition, 
just as there are many ways to account 
ior performance impairment besides the 
assumption that an engram has been 
destroyed. 

We disagree with them on many other 
counts. They are unhappy, for example, 
with conditioned inhibition as an ex- 
planation of ECS effects. In the case of 
multiple ECS treatments they are no 
longer offering the consolidation hypoth- 
esis as an alternate (e.g. see McGaugh, 
Westbrook, & Thomson, 1962, who dis- 
cuss multiple ECS studies as supporting 
"perseveration-consolidation" ). Theyare 
not, in fact, offering any alternative. 
Until they, or others, do we must point 
out that the inhibition hypothesis does 
explain an impressive bulk of experi- 
mental data and resolves many of the 
contradictions that arose when these ex- 
periments were interpreted in engram 
terms. 

McGaugh and Petrinovich feel that we 
have been unaware of the results of single 
ECS studies. Examining those studies 
they have cited, we find that they do 
report performance impairment ; they do 
not provide experimentally crucial evi- 
dence for the consolidation hypothesis. 
The studies are generally compatible 
with the conditioned-inhibition hypothesis. 

Under the circumstances we shall con- 
tinue to explore the conditioned-inhibition 
hypothesis with every expectation that 
empirical data will provide the corrective 
amendments that hasty criticism does not. 


CONCLUSION 

cle we reviewed the 
for about 20 years, 
ery few exceptions, 
ion of neural con- 
ued that the litera- 


In our previous arti 
ECS literature which, 
had been taken, with v 
as confirming the noti 
solidation. It was arg t 
ture was largely irrelevant to the issue 
and that only the very few experiments 
using one could be considered as 

hese experiments, only 


relevant. Among th 
‘as (Chorover & Schiller, 1965b) had 
worked with a sufficiently short interval 
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between learning and convulsion to be 
clearly within a possible consolidation 
period. This one study had found no 
behavioral impairment beyond a 10-sec- 
ond interval. This interval has since been 
confirmed by a second experiment, and 
serious doubt remains that the impairment 
found in other experiments using one 
ECS is due to interference with a consoli- 
dating engram. In the present paper we 
have defended our point of view from 
some hasty criticism, pointed out some of 
the reasons why most single ECS studies 
are suspect, and defended our hypothesis 
of conditioned inhibition for the retro- 
grade amnesic effects of ECS. 


REFERENCES 


CHEVALIER, J. A. Permanence of amnesia 
after a single posttrial electroconvulsive 
seizure. Journal of Comparative and 
Physiological Psychology, 1965, 59, 125- 
127. 

Cuonovgn, S. L., & ScHiLLER, P. H. Retro- 
grade amnesia. Science, 1965, 129, 
1521. (a) 

Cuorover, S. L., & Scuitter, P. H. Short- 
term retrograde amnesia in rats. Journal 
of Comparative and Physiological Psychol- 
ogy, 1965, 59, 73-78. (b) 

Cnuonovgg, S. L., & Scuter, P. H. Re- 
examination of prolonged retrograde am- 
nesia in one-trial learning. Journal of 
Comparative and Physiological Psychol- 
ogy, 1966, 61, 34-41. 

Kimete, G. A. Hilgard and Marquis con- 
ditioning and learning. New York: Apple- 
ton-Century-Crofts, 1961. 

Kine, R. A. Consolidation of the neural 
trace in memory: Investigation with one- 
trial avoidance conditioning and ECS. 
Journal of Comparative and Physiological 
Psychology, 1965, 59, 283-284. 


Lewis, D. J., & Mauer, B. A. Neural con- 
solidation and electroconvulsive shock. 
Psychological Review, 1965, 72, 225-239. 

McGaucH, J. L, & Perrinovicu, L. F. 
Neural consolidation and electroconvulsive 
shock reexamined. Psychological Review, 
1966, 73, 382-387. 

McGaueu, J. L., Westproox, W. H., & 
Tuomson, C. W. Facilitation of maze 
learning with  posttrial injections of 
5-7-diphenyl-1-3-diazadamanton-6-07 (1757 
LS.). Journal of Comparative and Physio- 
logical Psycholoyy, 1962, 55, 710-713. 

PEARLMAN, C. A., JR, SHARPLESS, S. K,& 
Jarvix, M. E. Retrograde amnesia pro- 
duced by anesthetic and convulsant agents. 
Journal of Comparative and Physiological 
Psychology, 1961, 54, 109-112. 

QuanrERMAIN, D., Paotino, R. M., & Mil- 
LER, N. E. A brief temporal gradient of 
retrograde amnesia independent of situa- 
tional change. Science, 1965, 149, 1116- 
1118. 

RourrENBERG, A, & Koy, K. E. Effect of 
one electroconvulsive seizure on rat be- 
havior. Journal of Comparative and 
Physiological Psychology, 1965, 59, 285- 
288. 

TENEN, S. S. Retrograde amnesia from 
electroconvulsive shock in a one-trial ap- 
petitive learning task. Science, 1965, 148, 
1248-1250. 

Tuomson, C. W., McGavan, J. L., SMITH, 
C. E, Hupsreru, W. J., & WESTBROOK, 
W. H. Strain differences in the retro- 
grade effects of electroconvulsive shock on 
maze learning. Canadian Journal of Psy- 
chology, 1961, 15, 69-74, 

WirLiAMs, G. J. The effect of electrocon- 
vulsive shock on an instrumental condi- 
tioned emotional response ("conflict"). 
Journal of Comparative and Physiological 
Psychology, 1961, 54, 633-637. 


(Received December 20, 1965) 


SEPTEMBER 1966 


PSYCHOLOGICAL REVIEW 


A MARKOV MODEL FOR CLASSICAL CONDITIONING: 
APPLICATION TO EYE-BLINK CONDITIONING 
IN RABBITS! 


JOHN THEIOS AND JOHN W. BRELSFORD, JR 


University of Texas 


A 3-state Markov model is postulated for classical conditioning. Specific 
implications of the model for ev blink conditioning are discussed. The 
Markov model, à general response-independent model, and a single- 
operator linear model with individual differences were applied to eye- 
blink conditioning data of rabbits. Likelihood ratio analyses of indi- 
vidual response sequences indicate that the Markov model is f r superior 
to the other models as à description of the conditioning process in 
rabbits. Speculations concerning theoretical identifications of the 
abstract states of the Markov process are made. Application of the 
Markov model to human eyelid conditioning is discussed. 


Noxious stimulation of the eye of 1962). Their research indicates that 
the rabbit results in a reflex response paired presentations of a conditioned 
which is characterized by retraction stimulus (CS), for example, a tone: 
of the eyeball into its orbit, extension andan unconditioned stimulus (UCS), 
of the nictitating membrane partially for example, an air puff to the eye, 
across the cornea, and closure of the result in an increase in the probability 
outer eyelids. The work of Gormezano of an eye blink to the CS. However, 


and his associates has shown that each presentations of the CS alone, UCS 
of these three components of the alone, OF unpaired pres 
rabbit eye-blink reflex may be reliably CS and UCS, result in no systematic 
conditioned to a neutral stimulus and 
that the three response components 
are very highly correlated (Deaux & The purpose of en 
Gormezano, 1963; Gormezano, Schnei- develop a theoretical description of 
derman, Deaux, & Fuentes, 1962; the changes which take p 
Schneiderman, Fuentes, & Gormezano, trial-by-trial probability of response 
X. vndis an eye- 
T ual ra 3 
1 This research was Su ported by PHS hes individ al : ex eriment. Ex- 
Research Grants HD 00950-01, 02, 03 from blink conditioning Pl membrane 
the National Institute of Child Health and nictitating MO 
Human Development Public Health Service. was classically conditioned in 1 
We wish to express 9 i 4 ‘method and procedure 
Rodney W- McGinnis for developing the į ; : relsford and Theios 
computer programs for obtaining the likeli- i described m = from 70 of the 101 
hood values reported in this paper- (1965), where da TI biect 
? Now at Stanford University. rabbits are presente 5 he subjects 


tension of the 
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(Ss) were given 150 trials of condi- 
tioning in a single session with a 
-25-second CS-UCS interval and a 
constant 11i-second intertrial inter- 
val. The CS was an 88-decibel buzzer, 
and the UCS was a single 1-milli- 
second spike of electric shock, de- 
livered across the rabbit's right eye. 
The external eyelids of the rabbit were 
not clamped open. Thus, the re- 
sponse involved extension of the 
nictitating membrane, closure of the 
outer eyelids, and retraction of the 
eyeball. 

Under these conditions, the initial 
probability of a response to the CS 
in the .25-second observation interval 
after CS onset is effectively zero. 
After conditioning, however, rabbits 
have an asymptotic probability of 
about .95 of blinking to the CS in the 
-25 second interval before the UCS on- 
set. Operationally, a response to the 
CS which occurs during the observa- 
tion interval will be designated a con- 
ditioned response (CR). Justasin the 
experiments by Gormezano and his 
associates, the group mean probability 
of a CR increases as a slightly S- 


shaped function of the number of 
paired CS-UCS trials. 


RESPONSE-INDEPENDENT MODELS 


In situations such as classical con- 
ditioning, in which the stimulus 
events are presented on every trial, 
there are two major approaches which 
may be taken with respect to the 
construction of learning models. The 
first approach is to assume that the 
theoretical response probabilities are 
described by the ordered set {p,, 
f. sad az bu}, where pa rep- 
resents the probability of a CR on any 
Trial z, for 2 2 1,2, .. ., M. With 
this type of model, the response given 
on Trial z depends only upon the 
value of 5, and is independent of the 
sequence of responses given on trials 
prior to n. Models having this inde- 
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pendence property include the single- 
Operator linear model of Bush and 
Mosteller (1955) and Estes (1950), 
the single-operator beta model of Luce 
(1959), and the response-strength 
model of Spence (1956). In the first 
stages of model construction it is 
often convenient to assume that the 


Ss are homogeneous with respect to ` 


the values of the parameters of the 
model. Assuming homogeneity of 
parameters, the single-operator linear 
model predicts that the probability 
of a CR for individual Ss as well as for 
the group should be an exponentially 
increasing function of trials. Since 
the obtained group mean functions for 
rabbit eye-blink conditioning are typ- 
ically S shaped (Schneiderman et al., 
1962), the assumption of homogeneity 
of parameters cannot be made if one 
is to accurately predict the mean 
learning curve by the single-operator 
linear model. The homogeneity as- 
sumption can be made with respect 
to the strength models of Luce (1959) 
and Spence (1956) since these models 
permit S-shaped conditioning curves 
for individual Ss. 


Marxov MopeL 


As an alternative to the response- 
independent models, it may be postu- 
lated that response probability on 
Trial z is a function of the sequence of 
responses which occurred on the 
previous trials. Because the class of 
models having this dependence prop- 
erty is so large, our consideration will 
be restricted to the specific response- 
dependent Markov model defined by 
the following transition matrix, re- 
Sponse vector, and starting vector: 


Pr Pr 
C A NN (CR) (start) 
é 10 9 Be 0 
A |; l=< 0 E ] |? j 
N Ü @ tga Py 1 
[1] 
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The rows of the transition matrix 
represent possible states of respon- 
siveness on any Trial z of an experi- 
ment, and the columns represent 
possible states on Trial 2 4-1. The 


cells of the matrix give the prob- 


ability of S moving from one given 
state to another during the intertrial 
interval. The entries in the first 
vector give the probability of a re- 
sponse during the observation inter- 
val for each of the three states of 
responsiveness. The entries in the 
second vector give the probability of 
an Sostarting the experiment in a 
particular state. 

It is assumed that S starts out in 
an initial, naive state (JN), in which 
the probability of a response to the 
CS is Py. With rabbits, Py would 
not be a free parameter, but would 
take on the a priori value of zero. 
Following each application of the 
UCS, there is a fixed probability a 
that S becomes activated, aroused, or 
sensitized. This is represented by 
the process moving from State JV to 
State A. When S is activated, the 
CS may evoke a response with a 
fixed probability P4. After S has 
become activated, there is a fixed 
probability c that the response be- 
comes conditioned to the CS. This 
is represented by the process moving 
from State A to State C. When con- 
ditioning has occurred, the probability 
is quite high that the CS evokes a 
blink before the onset of the UCS. 
This probability is represented by Pc. 
Thus, the model predicts that there 
are three discrete levels of perform- 
ance (Py, Pa, and Pc) with an all-or- 
none transition from one level to the 


next. 
MATHEMATICAL ANALYSIS OF THE 
MARKOV MODEL 


In order to make predictions from 
the Markov model, it will be necessary 


to specify the probability that S is in 
a given state on Trial z of an experi- 
ment, where » = 1, 2, 3, .. 5 M. 
Let w;,n represent the probability that 
S is in State 7 at the beginning of 
Trial z, where i = N, A, C. From 
the transition matrix in Equation 1 
it can be seen immediately that 


Wyn = (1 a)". [2] 


For an S to be in State A on Trial 7 
he must have remained in State N for 
k trials ( 21, 2,..4% — 1) be- 
fore moving to State A, where he 
remained for (z—E—1) trials. Thus, 
the probability of being in State A 
at the beginning of Trial z is 


WAn= Y (1 —a)*7a(1—0)77*7. [3] 


kel 


The solution to the sum in Equation 
3 is 
waa = alc — a)! 


x[ü—-om--971 [4] 


The probability that Sis absorbed in 
the conditioned state, C, by Trial n 
may be obtained by subtraction as 


Wen = d — Wyn — Wan [5] 


The theoretical probability of a CR 
as a function of trials will be a 
weighted sum of the probabilities of a 
CR in each of the three states, where 
the weights are the probabilities of 
being in each of the states on Trial 2, 


Pr(CRn) = Pxwww 
HR) Fe i e 18 


It can be seen from Equations 2, 4, 


and 5 that, as the number of trials 


becomes large, WN,» and a,» ap- 


proach limits of zero, and We,n ap- 


proaches a limit of unity. 'Thus, 


Lim Pr(CR&) = Pe. [7] 


n=% 
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Probabilities of Response Sequences 
over Trials n ton + 3 


The trial-by-trial sequential struc- 
ture of the data may be assessed by 
considering sequences of ordered 4- 
tuples of responses for individual 
rabbits. Define a random variable 
X, which indicates the response made 
by an S on Trial z: 


0 
X, = 
1 


Over Trials n to n + 3 there are 16 
possible response outcome sequences 
of the form Osn = (X, Xnar, Xanga 


if CR on Trial z, 
[8] 


otherwise. 


Xna} for s = 1,2,..., 16: 

Oin — [0000] Osn = 11000] 

Ozn = [0001] Oy, = {1001} 

05, = 10010] On, = {1010} 

Osn = {0011} 05, = {1011} 

Osn = {0100} O12, = {1100} [9] 
Oem = [0101] 01, = [1101] 

Orn = {0110} O15, = {1110} 

Osn = {0111} Orsa = {1111}. 


Given that an Sisina particular state 
i on Trial n, i = N, A, C, the condi- 
tional probability of a particular re- 
sponse sequence 0,,, may be repre- 
sented by Pr(0.|¿) which is inde- 
pendent of the trial number x. The 
(16X3) 248 expressions for Pr(0,|i) 
may be written down in a straight- 
forward manner from an examination 
of the transition matrix and response- 
probability vector in Equation 1, 
They are listed in the Appendix. 
The unconditional probability of a 
particular response sequence begin- 
ning on Trial n is a weighted sum of 
the three conditional probabilities 
represented by Pr(0,|2), 


Pr(0,,) = sey,,Pr(0,]N) 
+ wa s Pr(0,|A) 


+ wemPr(0.|C). [10] 
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EVALUATION OF THE MODELS 


In evaluating the Markov model, 
two approaches are available. First 
it can be assumed that Ss are homo- 
geneous with respect to the values of 
the parameters of the model. Esti- 
mates of Py may be obtained cither 
from a pretest period in which Ss are 
given trials of CS alone or from the 
observed proportion of responses on 
Trial 1 of the experiment. An esti- 
mate of Pe may be taken as the 
asymptotic proportion of CRs, given, 
that a sufficiently large number of 
trials has been run. Estimates of the 
values of the remaining parameters 
may be obtained by a minimum x* 
procedure. Let M represent the 
total number of trials given in an 
experiment, T the total number of Ss 
in the experiment, and F(0,,) the 
observed frequency of Ss who have 
response sequence 0,,, beginning on 
Trial z. Then, 


M—3 16 


L= X XLT-Pr(0,,)— FO) F 


n=l al 
XEL Pr(0,,) TE [11] 


The Pr(0,,), and thus x?, are func- 
tions of the parameters Py, Oy Pas d 
and Pe. Since estimates of Py and 
Pe are directly observable, we need 
only select values of a, Pa, and c 
which minimize the X. The minimi- 
zation can easily be accomplished 
with a high-speed computer pro- 
grammed to search for optimum pa- 
rameter values. ‘This minimum x? 
estimation technique and its proper- 
ties have been discussed by Atkinson 
and Crothers ( 1964). 

It should be noted that the x? de- 
fined in Equation 11 is not to be inter- 
preted as the x? as used in hypothesis 
testing. Independence of observa- 
tions has not been preserved in 
Equation 11. Each S in an experi- 
ment contributes 4/—3 observations 
to the x?. Since the assumption of 


MARKOV MODEL ror CLASSICAL CONDITIONING 397 


independence of observations has 
been violated, it cannot be guaranteed 
that the nonindependent x? will have 
the same sampling distribution as the 
independent x*. Thus, one cannot 
make a probability statement about 
a particular value of a nonindependent 
x? unless its sampling distribution has 
been determined. The nonindepend- 
ent x? is, however, an extremely useful 
statistic for parameter estimation. 
It may be interpreted as an overall 
measure of the weighted squared 
error between a set of theoretical 
predictions and a set of empirical 
data. The squared error of each 
predicted point is weighted by the 
reciprocal of the theoretically ex- 
pected number of observations at that 
point. With respect to theoretical 
models, the model which predicts 
the least weighted squared error, 
that is, yields the minimum non- 
independent x?, is of course the best 
fitting model. Appropriate parameter 
estimates can be obtained for any 
model by finding those parameter 
values which yield the minimum 
weighted squared error for that model. 

As an alternative to the assumption 
that Ss are homogeneous with respect 
to the parameter values, it can be 
assumed that there are individual 
differences among Ss with respect to 
the values of the parameters. The 
data for individual Ss would have to 
be summed over trials and individual 
parameter values estimated for each 
S. The x? would then become 


T 16 M-—3 


x 2 P Lx Pr (0.5.0 


mo 


M-a 
— X FO? 


KEE PrO dJ [12] 


where fis an S index, / = le cep Rs 


For experiments with a large num- 
ber of Ss each one of which is run for 
many trials, as was the case in the 
present study, the analysis required 
to obtain a least x? fit for each S is 
very time-consuming and extremely 
expensive, even with a very high- 
speed computer. Thus, from the 
standpoint of both research cost and 
theoretical parsimony, it would be 
desirable if homogeneity could be 
assumed, allowing predictions to be 
made with a minimum number of 
estimated parameters. It follows, 
therefore, that the model should 
initially be tested under the assump- 
tion of homogeneity ; that is, the test 
in Equation 11 should be conducted 
first. Only if the weighted squared 
error is large should the test in Equa- 
tion 12, with its inherent assumption 
of heterogeneity of parameter values, 
be performed. 

The x? test requires a reasonably 
large expected number of observa- 
tions in each of the response sequence 
categories. It is often extremely 
expensive to run enough Ss to insure 
a sufficient frequency. of all possible 
response sequences beginning on a 
particular Trial n. In order to cir- 
cumvent this problem, it may be 
necessary to sum over all trials or 
blocks of trials in the experiment. 
Under the homogeneity assumption 


and summing over all trials, the x? 
becomes 
16 M—3 
= LLL E P) 
"m nel 
M-—3 
— E Fen)? 
nel 
M-3 
XIT E Pr(QQ,,237-. [13] 


n=l 
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Sequential Analysis for the Response- 
Independent Models 


According to the response-inde- 


pendent models, 


Pila = il Xs =j 
Xn- = k, Xnr = ],...) 
= Pr(X, = i), [14] 


for 1, j, Rk, m, ... —0, 1. In the 
introduction, pa was used to represent 
Pr(X,- 0). Thus, Pr(X, = 1) is 
equal to 1 — 2, or q,. It therefore 
follows that the probability of a 
particular response sequence 0, 
= UG. =4, Xa =j, Denm =k, 
Xans = m} will be equal to 


Pr(0e,n) = (an) (2) (Gna) 4 (Pugs)? 
* 42) (052)! (053) (pn) ™, 
[15] 
for i, j, k, m — 0, 1. Having theo- 
retical expressions for the Osn, it 
is then possible to obtain a measure 
of the weighted squared error of the 
response-independent model through 


the x? procedures given in Equations 
11, 12, or 13. 


ANALYSIS OF THE DATA 


The 4-tuple sequences of responses 
will be analyzed first with respect to 
the Markov model. We proceed by 
specifically assuming that the rabbits 
are homogeneous with respect to 
parameter values. As mentioned 
earlier, Py takes on an a priori 
value of zero and will not be con- 
sidered as a free parameter. Every 
rabbit reached a criterion of 10 con- 
secutive CRs within 150 trials. _To 
estimate Pec, the mean proportions 
of CRs on trials subsequent to the 
criterion run of 10 consecutive CRs 
were computed and appear in Table 
1. As can be seen, the proportions 
are stationary at a value close to .95. 
'Thus, the estimate of Pc will be taken 
as .95. Estimates of the parameters 
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TABLE 1 


PROPORTION or CRs ON TRIALS SUBSEQUENT 
TO A CRITERION Run or 10 
Consecutive CRs 


Blocks of 5 trials Proportion of CRs 


NAM U Ne 
O 
E 


a, P4, and c were obtained by having 
a Control Data Corporation 1604 
computer select values which mini- 
mized the x? given in Equation 13. 
The parameter values which mini- 
mized the x? were a — .030, P4 
— .375, and c — .037. The observed 
and expected frequencies of the 4- 
tuple response sequences, summed 
over all trials of the experiment, are 
given in Table 2. An examination of 
Table 2 indicates that the Markov 
model is following the profile of the 
data very well and yields a minimum 
x? of 19.66. 

In spite of the fact that the model 
is accurately predicting the frequen- 
cies of the response sequences, it 
might be argued that since we have 
summed over trials, the model may 
not be faithfully predicting the trial- 
by-trial changes in the probability of 
a CR. To assess the trial-by-trial 
changes in response probability, we 
require that the Markov mode] pre- 
dict the mean learning curve, Equa- 
tion 6, using the parameter values 
estimated on the basis of the 4-tuple 
Sequences of responses. The ob- 
served and predicted proportions of 
CRs in blocks of 10 trials are given 
in Figure 1. It is difficult to imagine 
how the predictions could be im- 
proved. The Markov model is doing 
an excellent job predicting the trial- 
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OM THE MARKOV AND RE- 
ENDENT MODELS AND 


MODEL WITH INDIVIDUAL 
DIFFERENCES 


Sequence | Data Markov | Independent | Linear 
0000 | 7361 |7348 4779 5663 
0001 473 | 461 973 872 © 
0010 450 | 468 997 892 
0011 176 | 149 401 367 
0100 469 | 488 1024 914 
0101 127 150 415 379 
0110 161 162 431 392 
0111 231 | 215 428 395 
1000 553 | 544 1051 941 
1001 160 | 161 433 395 
1010 151 | 173 448 409 
1011 217 | 229 459 423 
1100 249 | 222 467 427 
1101 241 | 253 490 451 
1110 326 | 311 523 481 
1111 | 3502 | 3513 1528 1446 

d 19.66| 6422.41 | 5309.49 


by-trial changes in mean response 
probability. 

Consider now the entire class of 
response-independent learning models 
which have the property that the 
occurrence of a response on some Trial 
n is a function only of the probability 
of the response on that trial and is 
independent of the sequence of re- 


E 
ad 
£ 
$c 
m 
m 
zs 
$ 
E 
EE 
Es gy 
t. m 
o o OBTAINED 
] 
á pi m 
pe E r r A 
ac ar eet G PE NE ee 


BLOCKS OF IO TRIALS 


Fic. 1. Obtained group mean learning curve 
and prediction from the Markov model. 


sponses on all preceding trials. An 
adequate response-independent model 
would have to predict values of 
[po Po $n. bu} which would 
accurately describe the obtained mean 
learning curve given in Figure 1. 
Under the hypothesis that the sub- 
jects are homogeneous, the maximum 
likelihood estimate of x for the most 
general response independent model 
is just the proportion of CRs on Trial 
n. The maximum likelihood esti- 
mates of pn from the rabbit data were 
entered into Equation 15 to predict 
the sequences 0,,. Using Equation 
13, a x? was then computed between 
the predicted and obtained frequencies 
of 4-tuples of response sequences, 
which are given in Table 2. As can 
be seen, the response-independent pre- 
dictions are not following the profile 
of the data. The data sequences are 
predominantly all CRs or all failures 
to respond (non-CRs). The response- 
independent model indiscriminately 
predicts too many sequences CON- 
taining both CRs and non-CRs. The 
x the measure of the weighted 
squared error, between obtained and 
predicted frequencies is extremely 
large, 6,422.41. It is obvious that 
the response-independent model is 
giving an extremely poor approxima- 
tion to the sequential features of the 
data. 


Individual Differences 


Itis obvious that under the assump- 
Ss, the entire 


tion of homogeneity of 
class of response-independent models 
cannot be considered a realistic ap- 
proximation to the eye-blink € 
tioning process. The next logica 
step is to assume that a response- 
independent model holds for condi- 
tioning in individual Ss, but that Ss 
differ with respect to the values of 


the parameters of the model. Un- 


fortunately, a reasonable estimate of 
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Pn for the general case of the response- 
independent model is not available 
from the data of a single S. The 
estimate of p, would be unity or zero 
depending upon whether the subject 
made a CR or not, respectively, on 
Trial 2. With these estimates, the 
responsc-independent model would be 
trivially true since it would predict 
the data exactly. It would, however, 
use up all the degrees of freedom in 
the data to do so. An alternative 
approach would be to assume some 
specific case of the response-inde- 
pendent model which would yield 
stable parameter estimates for indi- 
vidual subjects. This would leave a 
finite number of degrees of freedom 
and would permit meaningful predic- 
tions to be made. 

The most familiar and most widely 
applied case of the response-inde- 
pendent model is the single-operator 
linear model (Bush & Mosteller, 
1955; Estes, 1950), which requires 
that for an individual subject 7 


Pin = di — (Ai — pinar, [16] 


In Equation 16, p;,, is the theoretical 
probability of a CR by subject 7 on 
Trial n, ^; is theasymptotic probability 
of a CR by subject i, and a; is a rate 
of growth parameter which can take 
on only values between zero and 
unity. For experiments in which 
the observed proportion of CRs for an 
individual is asymptotically stable at 
some value ^; the maximum likeli- 
hood estimate of the parameter a; is 


gsi Kip NAS [ir] 


where N; is the total number of trials 
subject / was run in the experiment, 
and S; is the total number of CRs 
which he made in the N; trials. To 
obtain an estimate of ai it is first 
necessary to obtain appropriate esti- 
mates of the initial and asymptotic 
response probabilities. In general, 
the best way to obtain an estimate of 
Pia is to run a pretest in which the 


TABLE 3 
FREQUENCIES Or INDIVIDUAL PAR 
VALUES FOR THE SINGLE-OPER 
LINEAR MODEL 


Parameter value « (rate) À (asymptote) 
under .80 4 6 
.80- 1 0 
.82- 0 3 
.841- 0 10 
.86- 1 0 
.88- 8 10 
.90- 3 0 
.92- 8 25 
.94- 19 31 
-96- 50 0 
.98- 6 16 


CS is given alone for a number of 
trials. In the experiment under 
discussion, none of the 101 rabbits 
blinked to the CS on 10 pretest 
trials with CS alone, nor did any S 
blink to the CS on the first trial of 
conditioning. Thus, the maximum 
likelihood estimate of Pii for all Ss 
iszero. Inspection of the group mean 
learning curve in Figure 1 indicates 
that response probability did not 
change during the last 30 trials of 
the experiment, Trials 121-150. From 
the standpoint of the linear model, 
a case can be made that the data 
are asymptotic after Trial 120. For 
each S an estimate of \; was obtained 
as the proportion of CRs on Trials 
121-150. Taking each S's A; estimate 
and assuming Pii = .0, a maximum 
likelihood estimate of a; was obtained 
for each S using Equation 17. Table 
3 gives the obtained frequency dis- 
tribution of the estimates of a; and 
^ for the single-operator linear model. 

Table 2 presents the 4-tuple se- 
quences of responses predicted by the 
linear model with individual rate and 
asymptote parameters. A compari- 
son can be made with the actual data 
and with the predictions from the 
Markov and general response-inde- 
pendent models which assume homo- 
geneity of the parameters. The 
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measure of the weighted squared 
error for the linear model with indi- 
vidual differences is only slightly 
better than that of the general 
response-independent model under the 
homogeneity assumption, x* — 5309.49. 
Thus, even permitting individual dif- 
ferences in rate and asymptote of 
learning, the linear model is unable 
to give a satisfactory account of the 
third-order sequential dependencies 
in the data. By comparison, the fit 
of the Markov model is amazing. A 
total of 203 parameters were esti- 
mated from the entire data matrix 
of 15,150 responses in order to get 
the predictions for the linear model 
with individual differences and 150 
parameters were estimated to get the 
predictions for the general response- 
independent model, but only 5 param- 
eters estimated to get the predictions 
for the Markov model. 


COMPARISON OF THE MODELS 


Given estimates of pj,» it is possible 
to obtain a likelihood function for the 
entire Ss X Trials data matrix of an 
experiment. For any response-inde- 
pendent model the likelihood of any 
specific data sequence N; trials long 


N-2 N-1 


wero E 


j=l k=j+l 
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for a particular subject 7 is 


Ni 
le= II(1 25035952, ***, [18] 

n=l 
where X;., equals 0 if subject / makes 
a CR on Trial z and equals 1 otherwise. 
The joint likelihood L over a total 
of I Ss is just 

I 


i=l 


To obtain a likelihood function for 
the Markov model, we need to con- 
sider the trial on which S made a 
transition from the naïve state to the 
activated state and the trial on which 
S made a transition from the acti- 
vated state to the conditioned state. 
Designate these two trials as j and k 
where j is less than k. The trials 
on which the transitions occurred 
cannot be observed directly from 
the data, but they havesome likelihood 
of occurring on any trial from 1 to N. 
For the Markov model it can be 
shown that the likelihood of the 
particular response sequence N trials 
long given by subject 7 under the hy- 
pothesis of homogeneity of the param- 
eter values across Ss is 1; where 


fat — aE II ( Pree] 


n=l 


Kell — EAE IL 0- Pater red 


XE TL a — 295027771] 


N-1 
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[20] 


n=k+1 


p) (P) io] 
cis (Pa) =] 


ieee], 


n=l 
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As before, the joint likelihood of the 
entire Ss X Trials data matrix for the 
Markov model is just 


L'-]IgEpN. [21] 


del 


At this point it should be stressed 
that a likelihood value is a number 
between zero and unity which speci- 
fies how well a particular model is 
predicting the data from a particular 
experiment in all its aspects. A 
likelihood value is sensitive to all 
the sequential dependencies in the 
data. By contrast, the x? approach 
used earlier is sensitive only to third- 
order sequential dependencies. Thus, 
the likelihood approach is much more 
powerful than the x? approach, and 
inadequate models can be detected 
more easily with a likelihood ap- 
proach. When considering a set of 
alternative models with respect to a 
specific set of data, the model which 
yields the largest likelihood value is 
the model which is predicting the 
data best and which has the highest 
probability of being isomorphic with 
the process which actually generated 
the data. Thus, all other considera- 
tions aside, the model which yields 
the largest likelihood value is the 
best fitting model. 


Likelihood Ratio Evaluation 


The ratio of two likelihoods from 
two different models also has very 
interesting properties. Let R repre- 
sent the ratio of two likelihood values, 
with the smaller likelihood being 
placed in the denominator. It is a 
well-known statistical result that if 
one of the models is a submodel of the 
other, 2 log.& is distributed as x? 
where the degrees of freedom are 
equal to the difference in the number 
of parameters estimated for the 
numerator (general) model minus the 


number of parameters estimated for 
the denominator (restricted) model. 

Further, even if the likelihoods are 
from models of unrelated classes, 
the ratio R can be expressed as odds 
in favor of one model over the other. 
Specifically, the hypothesis that the 
data were generated by a process 
isomorphic with the model yielding 
the higher likelihood is R times more 
likely of being true than the hypothe- 
sis that the data were generated by a 
process isomorphic with the model 
yielding the lower likelihood. Thus, 
the likelihood ratio can be viewed as a 
measure of the relative efficiency of 
prediction of one model over another. 

Consider, now, the data of the 
rabbit experiment discussed earlier. 
For the general response-independent 
model with 150 estimated param- 
eters, the maximum likelihood L of 
the entire data matrix of 15,150 trial 
outcomes is 10-?!, For the single- 
operator linear model with individual 
differences the corresponding maxi- 
mum likelihood is 10-9? with 203 
estimated parameters. For the Markov 
model the likelihood of the entire data 
matrix given values of the 5 param- 
eters estimated by the minimum x? 
technique is 10-193, Comparing the 
response-independent model with no 
individual differences with the linear 
model with individual differences, 
the likelihood ratio R is 1052, Thus, 
the linear model with individual dif- 
ferences is predicting the data much 
closer than the general response- 
independent model without individual 
differences. Comparing the Markov 
model with no individual differences 
with the single-operator linear model 
with individual differences, the likeli- 
hood ratio R is 10, in favor of the 
Markov model. Thus the hypotheses 
that the Markov model is isomorphic 
with the conditioning process is 10 
times more probable than the hypoth- 
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esis that the linear model and the 
conditioning process are isomorphic. 
This is true in spite of the fact that 
the Markov model has only 5 esti- 
mated parameters while the linear 
model has 203. These likelihood 
ratio comparisons give a precise 
statistical summary of what is obvious 
from inspection of Table 2. The 
Markov model is the best fitting of 
the three models considered in this 


paper. 


‘THEORETICAL IMPLICATIONS 


The analyses indicate that there 
are definite sequential dependencies in 
eye-blink responses of individual rab- 
bits. Early in conditioning, non- 
CRs tend strongly to follow non- 
CRs. Later, CRs tend strongly to 
follow CRs. The observed depend- 
ence is consistent with the three-state 
Markov model under the hypothesis 
that the rabbits are homogeneous 
with respect to rate of learning and 
asymptotic level of performance. The 
observed dependence in the data is, 
however, too great to be accounted 
for by a response-independent model 
such as the single-operator linear 
model of Bush and Mosteller (1955) 
and Estes (1950), the single-operator 
beta model of Luce (1959), and the 
response-strength model of Spence 
(1956). It is possible that the latter 
two models could account for the 
data if the assumptions of homo- 
geneity of the learning rate and 
asymptote were relaxed. It remains 
to be seen, however, whether it is 
possible with individual parameter 
estimates to obtain a likelihood value 
for either of these two models which 
would be larger than the value 
yielded by the three-state Markov 
model. Even if it proves possible, 
then the question of parsimony must 
be considered in comparing the models, 
since the Markov model has only 5 
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estimated parameters while the models 
with individual differences will have 
many more. 

In addition to predictive ability 
and parsimony, the Markov model has 
the desirable property of generality. 
It is a member of a family of models 
with three discrete levels of per- 
formance which have been shown to 
describe accurately learning in many 
situations with several species of 
organisms. In humans, for example, 
Markov models of this type suc- 
cessfully described paired-associates 
learning (Atkinson & Crothers, 1964), 
recognition and recall learning (Evans 
& Dallenbach, 1965; Kintsch & 
Morris, 1965; Waugh & Smith, 1962), 
and some aspects of eyelid condition- 
ing (Bower & Theios, 1964). Re- 
lated Markov models also accurately 
describe shock avoidance conditioning 
in rats and dogs (Theios, 1963; 
Theios & Brelsford, 1966) and shock- 
motivated choice behavior in rats 
(Bower & Theios, 1964; Theios, 
1965). Thus, on the criteria of 
precision of prediction, generality, 
and parsimony, there is reason to 
continue using and investigating the 
three-state Markov model as a tool 
in classical conditioning research. 

There are a number of alternative 
learning models which permit both 
gradual increases in response prob- 
ability over trials for individual 
subjects and large response-depend- 
ence effects. These include the two- 
operator linear model of Bush and 
Mosteller (1955), the two-operator 
beta model of Luce (1959), and the 


- del of Norman (1964). 
ie hat these models could 


It is possible t 1 
account for the response dependencies 
Because of the 


in the rabbit data. 
nature of these models, however, the 


algebraic task of deriving expressions 
for the predicted response sequences 
is quite formidable. 
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There is no reason why the Markov 
model considered in this paper should 
be restricted to eye-blink conditioning 
studies in rabbits. It is directly 
applicable to eyelid-conditioning stud- 
ies involving human Ss. The only 
change would be that the parameter 
Py would take on a nonzero value 
and would have to be estimated. 
The model which Bower and Theios 
(1964) applied to Gormezano’s human 
eyelid-conditioning data differs from 
the present model only in that in the 
present model the initial probability 
of a CR may be greater than zero 
and the asymptotic probability of a 
CR is less than unity. As a result, 
the model does not require that the 
proportion of CRs be stationary on 
the intermediate trials between an 
S's first CR and his last non-CR, as 
does the earlier model of Bower and 
Theios (1964). In fact, the present 
model predicts that the probability 
of a CR should increase on these 
intermediate trials. This follows from 
the fact that during the intermediate 
trials S may move from the naive 
state to the activated state or from 
the activated state to the conditioned 
state. Within the naive, activated, 
and conditioned states the probability 
of a response to the CS is assumed 
to be stationary at values of Py, P4, 
and Pe, respectively. However, if a 
transition from one state to the next 
state takes place during the inter- 
mediate trials, there will result a 
consequent increase in the probability 
of a CR, since Pc is larger than P4, 
which is larger than Py. Thus the 
model permits increasing CR prob- 
abilities on the intermediate trials. 
The model does predict stationarity 
of the intermediate responses only 
when the initial probability of a CR 
is absolutely zero and the asymptotic 
probability of a CR is absolutely unity. 
Prokasy (1965) has published data 
from eight eyelid-conditioning stud- 


ies with human Ss in which the 
response probability on the inter- 
mediate trials is not stationary. In 
each study, the initial CR probability 
was greater than zero and the asymp- 
totic CR probability was less than 
unity. 

Since the present Markov model 
permits S-shaped mean learning curves 
with initial levels of responding greater 
than zero, asymptotes less than 100% 
responding, and nonstationarity of 
response probability on trials between 
the first CR and last non-CR, we 
know of no published human evelid- 
conditioning data which is incon- 
sistent with it. Because of the 
extremely high variability which is 
often found in the data of human 
eyelid-conditioning studies, it will be 
necessary to take into consideration 
individual differences among Ss. 
These individual differences may take 
on several forms. For example, Ss 
may differ with respect to either 
learning rate or asymptotic perform- 
ance level, but be homogeneous with 
respect to initial level of performance, 
Alternatively, Ss may be homogeneous 
with respect to learning rate and 
asymptote, but may differ with re- 
spect to their initial level of per- 
formance. With human Ss it may not 
be reasonable to assume that all Ss 
start the experiment in the naive 
state as was assumed with rabbits. 
Even if Ss were homogencous with 
respect to the values of the param- 
eters, it is possible that a proportion 
a of the humans start out in the 
activated state and a proportion f 
start out in the conditioned statc. 
Thus, only a proportion 1 — a — 8 
would start out naive. If this state 
of affairs were true, an examination 
of the data might lead one to the 
erroneous conclusion that Ss were 
heterogeneous with respect to lcarn- 
ing rate, since some Ss would give 
CRs early in training and others not 
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until quite late in training. It would 
be expected that a and 8 would be 
related to such variables as the type 
of instructions or "set" given to S, 
S's past history, and whether or not 
a masking task is used. 
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APPENDIX 
CONDITIONAL PROBABILITIES of SEQUENCES 
As defined in the text, Pr(0,|i) represents the conditional probability of a specific 


response sequence 0,,, given that the conditioning process is in State i of the Markov 
chain on Trial z, where i = N, A, C. Let Q; = (1 — P;) and 


B = (1 — a)Pyx 
F= (1 — a)Ox 
Z= i= P. 
Y= (1-904. 


Then: 

Pr(0\|N) = PsLB* + B'aP4 + BaPa(Z + cPc) + aPA(Z? + ZcPe + cPc?)] 
Pr(Qi|A) = Pa(Z? + Z*cPc + ZcPe? + cP) 

Pr(0:|C) = Pet 

Pr(0s| N) = Py(B’F + B'aQa + BaPaA(Y + cQc) + aPA(ZY + ZcQe + cPcQo)] 
Pr(0:| 4) = Pa(Z?7¥ + Z*eQc + ZcPeQc + cPc*Qc) 

Pr(02| C) = PoiQe 

Pr(0s| N) = Py(B*F + BFaPs + BaQa(Z + cPc) + aPA(ZY + YcPc + cPcQc)] 
Pr(0s|A) = PA(G2Y + ZYcPe + ZcPeQc + cQcPc?) 

Pr(0;|C) = Pc'Qe 

Pr(04| N) = Py[BF* + BFaQa + BaQa(Y + cQc) + aPa(¥? + YaQc + cQc?)] 
Pr(0,|4) = Pa(Z¥? + ZYcQc + ZcQc? + cQe?Po) 

Pr(0,|C) = PeQc 

Pr(0s| N) = PuLB*F + FBaP, + FaPa(Z + cPc) + aQa(Z? + ZcPc + cPc?)] 
Pr(0s|A) = PA(YZ? + YZcPc + ZcPc? + cPc*Qc) 

Pr(05|C) = Pc?Qe 

Pr(0s| N) = PuLBF* + BFaQa + FaPA(Y + cQc) + aQA(ZY + ZcQc + cPcQc)] 
Pr(0,|A) = PACY?Z + YZcQo + YePcQe + cQc*Pc) 

Pr(0,|C) = Po? Qc* 

Pr(0:| N) = PuEFB + FaP4 + FaQa(Z + cPc) + aQa(Z¥ + VePo + cPcQc)] 
Pr(0;|A) = PA(Y?*Z + ZYcPc + YcPcQc + cPcQc*) 

Pr(07|C) = Pee 

Pr(Os|N) = PĒC? + F'aQa + FaQA(Y + cQc) + aQa(¥? + ¥eQe + cQc?)] 
Pr(0s|A) = Pa(¥? + Y'cQc + YcQe? + cQc?) 

Pr(04|C) = PcQc? 


For s = 9, 10, 11, . . » 16, a general equation for Pr(0,|7) is as follows: 


Pr(0,|2) = RiPr(0,—8|2), 
where R; = Q;/P; and i= N,4A,C. 
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TABLE 4 
CODED SEQUENTIAL DATA FOR Inpivipuat Rannirs (S) 
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Table 4—Continued 


Length of successive runs of non-CRs (odd positions) and CRs (even positions) 
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ATTACHMENT BEHAVIOR OF MAMMALS' 


ROBERT B. 
Indiana Un 


Available data support the empirical 


CAIRNS 


iversity 


generalization that animals tend 


to remain in the presence of objects to which they have been continu- 


ously exposed. From such observati 


ions, and certain assumptions of 


associative learning theory, it is proposed that mammalian attachment 


behavior is determined by: 


(a) the length of association with an object 


in a given context, and (b) the relative cue weight of the object. The 
model emphasizes the associative nature of the attachment process, the 
contextual and response relativity of attachment behavior, and the role 
of the immediate stimulus setting in the control of an animal's response 


» subsystems. 


This paper is concerned with the 
mechanisms by which the social at- 
tachments of mammals are acquired, 
maintained, and extinguished. Follow- 
ing a brief overview of the problem, 
a preliminary formulation on the na- 
ture of the attachment process will be 
outlined, and the relevant evidence re- 
viewed. To adumbrate the general 
proposal, no essential distinction is 
drawn between social and nonsocial 
attachments, and both classes of be- 
havior are seen as inevitable outcomes 
of an associative conditioning process. 


AN OVERVIEW 


Animals tend to approach and re- 
main with others of their kind. This 
response disposition is distinctive in 
its persistence and intensity, compara- 
ble in such respects to behavior pat- 
terns that are immediately relevant to 
physiological need states of the animal. 
But despite the prominence of this 
response tendency, and its obvious rele- 
vance to theories of social behavior, 
there have been until recently few sys- 


1 Preparation of this manuscript was facili- 
tated by à fellowship period at The Jackson 
Laboratory (PHS Special Fellowship 1-F3- 
MH-30, 205-10). I am indebted to several 
of my colleagues at Indiana University for 
critically reading earlier versions of this 


paper. 
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tematic attempts to explore its deter- 
minants in mammals (Beach & Jaynes, 
1954; Scott, 1962). 

Consequently, explanations of social 
attachment behavior in psychology tra- 
ditionally have been linked more di- 
rectly to prevailing general theoretical 
systems than to the results of specific 
empirical investigations. Early ac- 
counts which emphasized the operation 
of innate neurophysiological mecha- 
nisms, such as the “gregarious instinct” 
(McDougall, 1908) and “herd in- 
stinct” (Trotter, 1916), thus gave way 
to secondary drive concepts, for exam- 

le, “drive for gregariousness" (Dol- 

lard & Miller, 1950) and dependency 
motivation (Whiting & Child, 1953). 
Subsequent studies of the ontogeny o 
social attachment behavior have, how- 
ever, yielded meager support for key 
assumptions of the learned drive model 
(e.g, Harlow & Zimmermann, 1959; 
Schaffer & Emerson, ; Sears, 
Maccoby, & Levin, 1957). 


Various alternative hypotheses re- 


cently have been offered to account for 
ng plays in the for- 


the role that learning 
mation of mammalian attachments. 


In general, social bonds have been pre- 
when an animal, (S) 


sumed to form nin 
is maintained in close proximity to an 
crucial condi- 


object, (Oa) and some 
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tion, or set of conditions, is fulfilled. 
The latter has referred to characteris- 
tics of Oa (eg., Harlow & Zimmer- 
mann, 1959), to particular states of S 
(e.g., Collias, 1962; Scott, 1962), and 
whether O, has served as a general 
source of reinforcement events for S 
(e.g., Gewirtz, 1961). . ; 

But the history of research in this 

area, brief though it be, indicates that 
social attachments can occur in the ab- 
sence of assorted conditions that have 
been assumed to be primary. Attach- 
ments apparently develop in circum- 
stances which do not include: lactation 
by Oa (Harlow, 1958; Igel & Calvin, 
1960); immaturity of S (Altmann, 
1963; Jensen, 1965) ; physical contact 
between S and O, (Cairns & Johnson, 
1965) ; or nonpunitiveness of O, with 
respect to $ (Fisher, 1955; Rosen- 
blum & Harlow, 1963 ; Seay, Alex- 
ander, & Harlow, 1964). Accord- 
ing to one reviewer (Scott, 1963), 
available evidence Supports the conclu- 
Sion that various conditions facilitate, 
but are not necessary for, the forma- 
tion of social attachments, These re- 
Sults raise the compelling possibility 
that no special event, environmental or 
organismic, is an essential accompani- 
ment of proximity, Or, stated posi- 
tively, the evidence Suggests that an 
animal will form an attachment with 
Tespect to any object, animate or in- 
animate, with which it has been main- 
tained in a proximate relationship 
(Scott, 1963). 

Finally, some primary features of 
the reference phenomenon should be 
noted. In the first place, attachments 
occur discriminatively, for example, 
animals characteristically become more 
strongly attached to animate than to 
inanimate objects. ^ Moreover, the 
Closest social ties typically are formed 
With respect to one or more conspecific 
animals. Second, a number of condi- 
tions apparently facilitate the attach- 
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ment formation process. As noted 
above, recent studies have implicated 
both organismic and object/situational 
variables. Third, social bonds are not 
only formed and maintained, they or- 
dinarily diminish and are extinguished. 
Though few experimental studies are 
available with respect to the last point, 
naturalistic and observational evidence 
indicates that filial attachments invari- 
ably diminish in strength or shift to 
alternative objects. Any model which 
attempts to account for the attachment 
formation process in mammals should, 
as a minimum, specify the mechanisms 
which control the selectivity, rapidity 


of development, and extinguishability 
of these responses, 


Towanp A THEORY OF MAMMALIAN 
ATTACHMENT BEHAVIOR 


Previous results support the empiri- 
cal generalization that an animal forms 


Tespect to those objects with which it 
has been continuously maintained, Can 
a case be made for the stronger propo- 


de- 


hypothesis has Parsimony in its favor, 
it encounters igni 


retical import: 
qua contiguity 
and the direction of social attachment 
behavior ? D 
Such issues require that explicit at- 
tention be given to the nature of the 
learning process involved in social at- 


— 
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tachment formation. Following a brief 
overview of certain assumptions of con- 
tiguity learning theory as it has been 
developed by E. R. Guthrie (1952, 
1959) and W. K. Estes (1950, 1958, 
1959), some implications of these as- 
sumptions for social attachment be- 
havior will be examined. 

1. It has been assumed that a stimu- 
lus pattern which accompanies a re- 
sponse, and/or immediately precedes 
it, becomes a cue for that response 
(Guthrie, 1952; Voeks, 1950). Re- 
sponses, then, become associated to 
configurations of stimuli considered as 
units. Should the response be ex- 
tended over time, it would be condi- 
tioned to the several stimulus patterns 
sampled during that period. 

2. A second assumption concerns 
the weight, or importance, that any 
object or environmental event will be 

particular stimulus pat- 
In general, it has been assumed 
that the weight of a subset of stimulus 
elements is a joint function of the sam- 

F obability and the number of 
pling PrO hi Estes, 1958) 
elements within the set ( m , i $ 
Thus, the larger the set, OF E: S Bee d 
the sampling probability o Ree E 
ments, the more important E xm ES 
its role as @ determinant of oe ~— 3 

While theorems relating Size i ia 
subset to observable ae e p 
yet to be formally. develope des das 
been assumed that such stimu us as- 
pects as intensity, vividness, and “at- 
tention-getting” “characteristics covary 


with the number of elements in the 
subset (Atkinson & Estes, 1963, p- 
202). Whether an event 1s salient at 
any given moment would presumably 
also be a function of its physical in- 
volvement in S’s ongoing response (cf, 
the «discriminanda" and : manipu- 
landa" properties of stimuli, Tolman, 
1932). Holding sampling probability 
constant, then, highly conspicuous Or 
 response-involved events would be as- 


assigned in a 
tern. 
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signed a greater weight than incon- 
spicuous or noninvolved events in the 
stimulus patterns available to S. 

3. A third assumption has to do with 
the development and disruption of re- 
sponse sequences. The integration of 
ongoing behavior has been accounted 
for by assuming the formation of re- 
sponse chains, such that the perform- 
ance of one phase of an activity pro- 
vides the stimulus components for the 
next succeeding phase. Stimulus pat- 
terns supporting ongoing behavior thus 
would be made up of internal (eg. 
“movement produced," Guthrie, 1952) 
and external events. 

It seems reasonable to expect, fur- 
thermore, that a continuously present, 
heavily weighted cue can become a sig- 
nificant component in a large propor- 
tion of the stimulus patterns available 
to S." While such events are usually 
internal, under some conditions (e.g. 
continuous and exclusive exposure), 
prominent external cues would become 
conditioned to a considerable propor- 
tion of the subsystems in the animal's 
response repertoire. Their presence in 
stimulus patterns would thus provide 
for the maintenance of orderly be- 


RELATIVE WEIGHT OF CUE 
= 
EI 


5j bed abedeti. 


—————— 
TIME 


Fic. 1. Schematic representation of the 
proposed relationship between length of asso- 
ciation and cue weight in the conditioning of 


response subsystems. 
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havior ; their abrupt removal from the 
context would be the occasion for the 
disruption of orderly behavioral se- 
quences (cf., “behavior supports,” Tol- 
man, 1932, p. 85). : 

To summarize the foregoing, the 
probability that an external event will 
acquire effective control over a given 
response system of the organism will 
be determined by the joint operation 
of two variables: the weight of the 
stimulus subset, and the extent to which 
it has been present during the per- 
formance of several variations of the 
response system. This relationship is 
shown graphically in Figure 1. The 
ordinate refers to the weight of the 
stimulus subset relative to other ele- 
ments in the stimulus pattern; and the 
abscissa refers to the proportion of 
stimulus patterns in which the subset 
is represented. In terms of observable 
variables, subset weight would vary 
with stimulus salience (ie, intensity, 
movement, etc.) and response-involve- 
ment; while proportion of stimulus pat- 
terns (corresponding to response sub- 
Systems a, b, ...i,.) would be a 
function of the length of time that the 
event was present. The higher the 
ratio of shaded/total area within any 
given subrectangle, the greater should 
be the likelihood that a particular re- 
sponse system would be disrupted by 
the event's removal. 

The integration of the shaded areas 
across subrectangles should provide an 
index of the generalized control that 
any object has over S’s behavior. For 
instance, if a heavily weighted event is 
represented in all of the stimulus pat- 
terns which have been conditioned to 
S's basic response systems, it would 
attain generalized control over S's re- 
sponses. This state of affairs is repre- 
sented schematically in the upper right- 
hand corner of Figure 1. The event’s 
Temoval should be associated with at 
least a temporary disruption of all of 
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the response systems for which it was 
relevant. 

Following Guthrie (1952), it can 
further be argued that the reintroduc- 
tion of a widely conditioned cue (as 
indicated in upper right of Figure 1) 
constitutes a “reinforcing” state of af- 
fairs. The reasoning can be sum- 
marized as follows: (a) If the removal 
of a heavily weighted, generalized cue, 
C, disrupts all established behavior pat- 
terns, then (b) a series of disjointed 
responses, R,, Ra, Rs, Res. « Ry Re 
will occur in the Presence of the re- 
maining stimulus patterns, S.° and 
(c) each new response in the series, 
for example, R,, will alienate its prede- 
cessors R,, Ry, R, from their associa- 
tion with S.C, But let some response 
in the series, Rs, remove the animal 
from S,°, Then, on subsequent occur- 
rences of S,9, R, would ba evoked. 
) r » (d) any response, 
Rs, which reintroduces the widely con- 
apneo r ivalently, removes 
5.9, will remain conditioned to .S,6. 


1 ass of stimulus 
patterns produced jf an extensively con- 


ditioned event were removed abruptly 
from the tota] Setting. The event's 
removal would, in effect, make unavail- 
able the population of Stimulus pat- 
terns to which the animal's basic re- 
sponse Systems had been conditioned. 
The Temaining stimulus patterns— 
those in which the event was not rep- 
Tesented—thus would be unable to 
support the Previously established re- 
Sponse systems, Any response that re- 
moves S.° would become conditioned 
to S.C (cf. Guthrie, 1952, p. 297). 
In effect, Stimulus patterns created by 
an extensively con- 
di be expected, in- 
Itially, to evoke Tesponses which lead 


ATTACHMENT BEHAVIOR OF MAMMALS 


in the special instance where an event 
had hecome an essential stimulus com- 
ponent in a high proportion of S's re- 
sponse systems. But what would be 
the case if an event of low relative cue 
weight were removed? According to 
the general formulation, the less heav- 
ily weighted the cue, the less disrup- 
tive would be its removal. The ab- 
sence of this event presumably would 
not materially affect the individual 
stimulus configurations of which it was 
a part, and the corresponding response 
systems would reflect little, if any, gen- 
eralized disruption (Figure 1, lower 
left and right). Since the remaining 
stimulus patterns could support pre- 
viously organized response sequences, 
these stimulus patterns would not be 
expected to become selectively condi- 
tioned to response sequences which re- 
introduce the event. 

Still a third major possibility can 
he considered : namely, where an event 
is heavily weighted in the stimulus pat- 
terns associated with certain response 
systems, but has a low relative weight. 
or 1s not represented at all, in other 
stimulus patterns (Figure 1, upper 
left). Tts removal may or may not 
lead to behavioral disruption, depend- 
ent upon the stimulus patterns being 


sampled at the time the event was 
removed, 


Implications for Attachment Behavior 


The major thesis of this paper can 
he stated succinctly. Tt is proposed 
that an ohject, or class of objects, can 
acquire an essential cue function for 
the maintenance of the response pat- 
terns of an animal. Objects which 
appear in recurrent combination with 
other environmental or internal events 
can become significant components in 
the stimulus patterns which support 
S’s behavior, including such basic main- 
tenance response systems as drinking, 
reposing. eating, etc. When O, is 
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present, the associative chains remain 
intact and stable responding is ob- 
served; when O, is absent, the asso- 
ciative chains are disrupted, and non- 
organized responding occurs. At a 
gross descriptive level, § would appear 
to have formed an attachment with 
respect to Os. 

Social and monsocial attachment- 
objects. In accord with stimulus the- 
ory, no qualitative distinction is drawn 
between the various sources of stimu- 
lation. But events clearly differ in 
terms of such factors as attention-get- 
ting properties and potentiality for be- 
coming involved directly in the re- 
sponses of S. Specifically, animals are, 
among other things, tvpically more 
mobile, conspicuous, and capable of de- 
veloping patterns of reciprocal stimu- 
lation than nonliving objects. And it 
would follow that cues provided by 
animals are more likely to he observed, 
and conditioned, than those of inani- 
mate objects. On the other hand, it 
should be possible to enhance the likeli- 
hood of attachment formation (ie. 
generalized conditioning) with respect 
to an inanimate object by manipulating 
its stimulus characteristics. For in- 
stance, if an object is highly conspicu- 
ous relative to all other objects in S’s 
maintenance environment, it would be 
expected to acquire significant cue 
properties for S's behavior. Or the 
maintenance environment itself may ac- 
quire such properties. rs. 

Contextual and response relativity. 
While the major concern in the pres- 
ent analysis will he with the case where 
an event is conditioned to a wide range 
of responses over various contexts, it 
should be noted that attachment for- 
mation may be response- or context- 
specific. In it is entirely possible 


Indeed, i Uo 
for one object to be conditioned to one 
set of responses, and 


another object to 
be conditioned to an al 


ternative set. 
Whether multiple relationships would 
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be formed would depend, in part, ee 
the number of different contexts hae 
jects) in which S has been maintained, 
and the extent to which distinctive 
epis subsystems have been condi- 
tioned to the different contexts. — 

Attachment maintenance and extinc- 
tion. The model is obviously not a 
static one: The question of attachment 
maintenance is as relevant as 5n gues 
tion of attachment acquisition. ou 
S's basic response systems become con- 
ditioned to 5.9", Oas removal would 

roduce little if any disruption. Tt 
would follow that the longer the period 
of separation from Oa, the greater 
would be the likelihood that Os's pres- 
ence was not required for the mainte- 
nance of general stable responding. On 
the other hand, those habits that S had 
acquired specifically with respect to Oa 
presumably would remain relatively in- 
tact if S and Oa were abruptly sepa- 
rated and kept apart. (Note how this 
state of affairs would be schematized 
in the upper left of Figure 1.) I 

Though a detailed comparison of this 
proposal to other concepts of social 
drive” is beyond the scope of this paper, 
one comment seems in order. In con- 
trast with most biological drive sys- 
tems, where the number of drive cues 
is positively correlated with hours of 
deprivation, it can be argued that the 
effects of “social” deprivation are at 
full strength immediately upon separa- 
tion from Os. Since reconditioning is 
presumed to begin immediately upon 
separation, the extent to which S Oa 
disrupts organized responding should 
diminish rather than increase as a func- 
tion of time. 

“Punishment” and "rewards? 
Neither noxious nor positive stimula- 
tion is assumed to operate directly upon 
attachment “strength” (ie, extent to 
which O, is conditioned to S’s response 


subsystems), although both may oper- 
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ate upon it indirectly. In naturalistic 
settings, noxious stimulation by On 
with respect to S is associated with at 
least temporary physical separation. 
To the extent that the separation is 
extended, attachment strength would 
be expected to diminish. On the other 
hand, under some environmental cir- 
cumstances (e.g., infancy, enforced co- 
habitation), separation is not possible. 
To the extent that noxious stimulation 
from O, did not inhibit S’s orientation 
toward or the performance of responses 
with respect to Oa, “punishment” could 
enhance the salience of O, and the es- 
sential conditioning process. 

Whereas noxious stimulation might 
operate to either increase or decrease 
attachment strength, such events as the 
delivery of consumables by Oa would 
be expected to enhance it. Any condi- 
tion which makes O, more prominent, 
or requires S to interact with O,, 
should increase the proportion of stim- 
ulus patterns in which O, is repre- 
sented. If certain consummatory 
responses, for instance, invariably oc- 
curred in the presence of On, these 
responses would become conditioned 
to stimulus configurations in which O, 
was represented. In general, however, 
the model indicates that the number of 
interactions between S and Og, and not 
the quality of the interactions or their 
outcomes, is the immediate determinant 
of attachment strength, 

Level of analysis. Not the least im- 
portant of the model’s implications are 
those having to do with methodological 
matters. To evaluate adequately the 
deductions that follow from it, a de- 
tailed analysis of molar response pat- 
terns is required, similar to that advo- 
cated by Bindra (1961) and Logan 
(1960) for the study of response SyS- 
tems of the individual, and by Sears 
(1951) and Schneirla (1946) for the 
study of social behavior. "The problem 
for research becomes one of specifying 
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the factors in the immediate environ- 
ment that are controlling an observed 
response sequence. Because typically 
two or more animals are involved, the 
analysis must be concerned with chains 
of reciprocal or dyadic stimulation, 
where the behaviors of one animal be- 
come the cues for the responses of the 
other. Even the label “attachment be- 
havior” is seen to be misleading when 
the disparate responses that are sub- 
sumed by this construct are analyzed 
in terms of their component elements. 

Finally, it should be clear that the 
model is concerned with the analysis 
of behavioral phenomena (e.g., whether 
S approaches and remains in the pres- 
ence of another animal), and not with 
emotional states that have been some- 
times presumed to accompany or un- 
derlie the behavior (eg., affection, 
love, fear, anxiety). 

To summarize the association hy- 
pothesis rather drastically, it is the pro- 
posal that the development and main- 
tenance of intraspecific attachments can 
be explained as a by-product of a con- 
tinuing conditioning process. The for- 
mation of complex response chains is 
facilitated by the participation of rela- 
tively durable extraorganism events, 
and it is to these cues that response 
patterns become “attached.” The cues 
provided by animate objects, by virtue 
of their salience and involvement in S’s 
behavior, are particularly likely to be- 
come conditioned. 

Deductions regarding the conditions 
for the formation and maintenance 
of social attachments include the 
following : 

1. An attachment of some strength 
will be formed with respect to those 
objects with which s ‘ has been con- 
tinuously maintained in a proximate 
aan. us characteristics of the 
: 1l constitute a significant de- 
he conditioning process, 


415 


such that the greater its salience and 
its relevance for S’s activities, the more 
likely that S's response subsystems will 
become conditioned to the object’s 
presence. 

3. Removal of an object which has 
become conditioned to a high propor- 
tion of the response systems of S should 
be associated with a significant disrup- 
tion of S's ongoing behavior; replace- 
ment of the object should be associated 
with a cessation of disruption. 

4. If a generalized conditioned en- 
vironmental event, Oa, is removed, 
and some activity of S reintroduces it, 
then this activity is likely to be re- 
peated upon subsequent occasions that 
O, is removed. In general, the more 
complete the behavioral disruption of S 
upon removal of O,, the greater will 
be the likelihood that S will perform 
a response that reintroduces Oa. 

5. The strength of S’s attachment 
with respect to O, should be inversely 
related to the length of separation 
from Oa. 


Some Recent EXPERIMENTAL 
FINDINGS 


Within the past 10 years a sufficient 
number of studies of mammalian at- 
tachment behavior have been completed 
to permit a preliminary evaluation of 
the association model. Unfortunately 
for comparison purposes, there has 
been considerable variation among 11- 
vestigators with respect to the proce- 
dures followed and the response meas- 
ures employed. Diversity of method 
is not without compensations, however. 
Since the work has not been focalized 
single species, it is 
possible to assess the adequacy of the 
model in terms of the attachment be- 
havior of several orders of the class 


Mammalia? 


exclusively on a 


jes which are seen as 


2 tud: EN: 
Only ipse - levant to the association 


being immediately rel 
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Effects of Extended Exposure 


A primary implication of the asso- 
ciation hypothesis is that a prominent 
environmental event will acquire sig- 
nificant cue properties for an animal 
which has been maintained in a proxi- 
mate relationship to it. Data relevant 
to this expectation are provided by 
three types of studies: (a) interspe- 
cific cohabitation; (5) isolation; and 
(c) intraspecific cohabitation. 

Interspecific cohabitation. The pro- 
cedure typically followed in these stud- 
ies has been to rear a young animal 
with an alien species or distinctive 
inanimate object, and to assess at some 
later time the animal’s preferences rela- 
tive to that species or object. Occa- 
sionally the interspecific attachment 
formation of adult Ss has been studied 
(e.g. Hersher, Richmond, & Moore, 
1963b). Work has been conducted 
with rodents (e.g., Denenberg, Hud- 
gens, & Zarrow, 1964; Nagy, 1965), 
carnivores (e.g. Igel & Calvin, 1960; 
Kuo, 1930, 1960), ungulates (e.g., 
Cairns, 1964; Hersher, et al., 1963b), 
and subhuman primates (e.g., Harlow, 
1961; Harlow & Zimmermann, 1959: 
Mason & Green, 1962). Various 
measures have been employed to as. 
sess attachment strength, ranging from 
indices of time spent in contact with 
the cohabitant to assessments of choice 
behavior in a U maze where the sub- 
ject is permitted to select either his 
cohabitant or a conspecific animal. 
The results of these investigations in- 
dicate that the experimental animal 
tends to reinstate the “social” object 
to which he has been continuously 
exposed. Cohabitant preference has 
been demonstrated even when S must 
choose between an alien species and his 


model have been cited. The review thus js 
selective rather than comprehensive. Due to 
space restrictions, discussions of Particular 
methodological issues have been omitted, 
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own (e.g, Cairns & Johnson, 1965; 
Denenberg, et al, 1964).3 

Isolation studies, Within the frame- 
work of the present theoretical analy- 
sis, the isolation experiment is viewed 
in terms of the Cues that are present 
and available to be conditioned in the 
“isolation” context. The emphasis is 
not upon what the animal has heen 
"deprived" of, but what he has been 
exposed to. This orientation can be 
contrasted with the tendency to view 
isolation in terms of its “deprivation” 
aspects, that is, that which has been 


withheld from the animal (cf., Lorenz, 
1965). 


1959), for example, elimi- 
self-visual 
grooming stimulation, 
reservation, it would be ci 
the association model to expect that 
animals reared in isolation would be 
less likely to develop res 
that require 
specific object than 
Which have been m 
Own species, 


r animals 
aintained with their 


information about the str 


r ength of con- 
specific preferences, 


There 
or animals main- 
ared with non- 
a diminished tend- 
other members of 


male Sex-partners more 
females reared with non- 
For male S's, however, 
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their own species. As in the cohabita- 
tion studies, evidence for this generali- 
zation is available for various mamma- 
lian species, including rats (Anger- 
meier, 1960; Ashida, 1964), cats (Kuo, 
1960; Schneirla & Rosenblatt, 1961), 
sheep (Scott, 1945), and monkevs 
(Harlow, 1962; Mason, 1960; Sackett, 
Porter, & Holmes, 1965). These re- 
sults are generally consistent with the 
interpretation that the conspecific ob- 
Ject may constitute a disruptive cue, or 
at the least, not be required for the 
performance of the isolated animal’s 
basic response repertoire. 
_ Intraspecific cohabitation. Informa- 
tion on the determinants of conspecific 
attachment formation is available for 
subjects ranging from human and in- 
frahuman primates (e.g., Ainsworth, 
1963; DeVore, 1963) to maternal lab- 
oratory rats (e.g, Rosenblatt & Lehr- 
man, 1963). With a few exceptions 
(e.g.. Mason, 1961 ; Nowlis, 1941), the 
research has focused upon the mother- 
infant relationship. Intraspecific stud- 
les cannot he easily interpreted as being 
peculiarly relevant to the association 
assumption, since length of exposure 
is confounded with several other poten- 
tially effective variables, including ef- 
fects of caretaking, lactation, and pos- 
sible innate determinants of preference. 
This work has yielded two general 
findings, however, that seem imme- 
diately relevant to the present hy- 
pothesis. 
One finding concerns the typical in- 
sulation of mother-young units from 
competing associations (Beach & 
Jaynes, 1954). For slow-maturing 
mammals (analogous to altricial birds) 
insulation is assured by the incompe- 
tence of the neonate. Moreover, 
where the young are precocious, the 
environment is typically arranged to 
keep the mother and her offspring iso- 
lated. In the case of wild ungulates, 
for instance, the maternal animal physi- 
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cally separates her infant and herself 
from the herd (eg., Altmann, 1963; 
Darling, 1937). Among domesticated 
ungulates, further, a functional isola- 
tion of maternal-infant pairs is typi- 
cally maintained by the rejection of 
the young animal by potential surro- 
gates. Should the neonatal-maternal 
association be disrupted shortly after 
parturition, the mutual attachment 
process is retarded or inhibited (Col- 
lias, 1956; Hersher, Richmond, & 
Moore, 1958). 

Second, it appears that prolonged 
and extensive interaction. between a 
maternal animal and her offspring pro- 
motes a dyadic conditioning process, 
where the behaviors of the one acquire 
a significant cue function for the other. 
For instance, Rosenblatt & Lehrman 
(1963) have observed how the cues 
of the neonatal rat pup provide the 
stimuli for the maternal activities of 
licking and retrieving. Maternal lick- 
ing, in turn, apparently elicits and 
directs the neonate's elimination and 
sucking responses. This behavior of 
the infant then stimulates further con- 
summatory behavior and stimulation by 
the mother. Similar descriptions of 
the reciprocal dependency of maternal- 
infant behaviors have been given for 
cats (Schneirla, Rosenblatt, & To- 
bach, 1963), sheep (Collias, 1956), 
and, at a more global level, human in- 
fants (Schaffer & Emerson, 1964). 

While maternal-infant interactions 
are uniquely suited for the develop- 
ment of a wide range of interresponse 
contingencies, the primary phenome- 
non may be also observed in nonfilial 
relationships. Nowlis (1941) has ob- 
served that an important determinant 
of whether a juvenile chimpanzee pre- 
fers one partner over another is S’s 
ongoing behavior: one partner was 
preferred for mutual grooming; an- 
other for play-fighting, etc. (also 
Mason. 1960). These observations are 
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consistent with the proposal that ier 
preferences reflect the extent to ic! 
stimuli provided by an object oe 
become integrated into, and provide 
cues for, particular response sequences 
of S. 


Stimulus Properties of Cohabitant 


One of the principal assumptions of 
the present model is that the likelihood 
that a given response or set of re- 
sponses will become conditioned to an 
environmental event is a direct func- 
tion of the stimulus weight of that event 
during the performance of the response. 
As discussed above, the stimulus 
weight of the event is presumed to be 
determined by its perceptual salience 
and its properties as a manipulanda, 

Given this assumption, along with 
the fact that particular variables facili- 
tate attachment formation (e.g., 
tion, tactile stimulation), 
argued that these stimulus 
tics are effective because t| 
the salience or interactiv 
of the object. While the 
plausible, 
logical. Unless there is 


lacta- 
it might be 
characteris- 
hey enhance 
e capabilities 
claim may be 
ecome tauto- 


» there seems 
riables 
ior are 

€ event 
relatively prominent or become in. 
volved directly in S's response pat- 
terns. An independe: 


neepencent specification of 
stimulus weight is obviously called for. 
It has been generally assumed that 


the attention-getting properties of an 
event depend, at least initially, upon 
such stimulus characteristics as com- 
plexity, variability, and intensity, 
There is some support for this assump- 
tion. One series of studies has dem- 
onstrated, for example, that rhesus 


monkeys spend more time viewing 


varied, moving, or chromatic scenes 
than nonvaried, static, achromatic ones 
(Butler, 1961; Butler & Woolpy, 
1963). Several iny estigations have re- 
ported, in addition, that complexity is 
an important determinant of attention 
in young humans (e.g., Cantor, Cantor, 
& Ditrichs, 1963; Fantz, 1961; Spears, 
1964). 

Few attempts have been made, how- 
ever, to relate stimulus conspicuousness 
or prominence directly to the strength 
of subsequent attachments. In a pre- 
liminary investigation of this relation- 
ship, it was found that lambs tended 
to form attachments with respect to 
the salient inanimate objects (i.e., op- 
erating television sets) with which 
they had been individually confiried 
(Cairns, in press, Exp. 1). By the 
end of the 9th week of cohabitation, 
the lambs (a) became disrupted when 
the sets were removed, and (b) learned 
to turn in the appropriate direction to 
reach the inanimate object in a series 
of U-maze trials. Lambs exposed to 
dogs for the Same period, but debarred 
from physical interaction with them, 
developed relatively stronger attach- 
ments with respect to their cohabitants 
than did lambs paired with the inani- 
mate objects,4 

The above results a 
assumption that envir 
do not have to servi 
in order to become 
nents of the stimulus 
Port S’s behavior, 
stimuli may play a More important role 
in mammalian behavioral development 
than has heen gene 


rally conceded (e.g. 

Fuller & Clark, 19664. Meier, 1965; 
- Supplementary evidence relevant to the 

paral Weight-attachment assumption has 
een reported in recent studies of avian im- 

Printing. Bateson (1964b) found that con- 
spictousness of a stimulus is a significant 
determinant of whether chicks will approach 


1" also Bateson, 1964a, and Collins, 


lso support the 
onmental events 
e as manipulanda 
significant compo- 
patterns that sup- 
Distance receptor 
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Sackett, 1965). Walters and Parke 
(1965) review related evidence. 


Behavioral Disruption 


According to the present model, the 
removal of a previously conditioned 
subset of stimulus elements, or the in- 
troduction of an unconditioned subset, 
would be expected to interrupt ongoing 
behavior. The amount and duration 
of the disruption would reflect, in part, 
the stimulus weight of the subset and 
the number of response systems that 
have been conditioned to it. Stimulus 
subsets may refer, of course, to a 
variety of internal or external events, 
including animals, objects, or places. 

Several recent studies have been 
concerned with the effects of removal 
from the home environment on be- 
havioral organization. Scott and his 
collaborators (e.g., Elliot & Scott, 
1961; Fredericson, 1950; Ross, Scott, 
Cherner, & Denenberg, 1960; Scott & 
Bronson, 1964) have shown that the 
placement of a dog 3-12 weeks of age 
alone in an unfamiliar context is asso- 
ciated with heightened activity, yelp- 
ing, and general behavioral disturb- 
ance. The disruptive effects of re- 
moval are lessened if (a) the animal 
is placed with another dog in the un- 
familiar context, or (b) is left alone 
in the home compartment rather than 
alone in the unfamiliar setting. Simi- 
larly, removal of an interspecific or 
intraspecific cohabitant from the home 
compartment has been shown to be as- 
sociated with behavioral disruption in 
young lambs (Cairns & Tohnson, 1965; 
Hersher, et al, 1963b) and monkeys 
( eg. Jensen & Tolman, 1962). Be- 
havioral disturbance in the human in- 
fant upon separation from the mother 
has been observed in naturalistic 
(Schaffer & Emerson, 1964) and lab- 
oratory contexts (unpublished study 
conducted by Don E. Fleener in the 
writer's laboratory). 


The marked tendency for the ma- 
ternal animal to become disrupted when 
separated from her own or foster off- 
spring also has been reported. Study 
of attachment development in the pig- 
tailed monkey indicates that the amount 
of maternal disruption observed in- 
creases as a function of the mother's 
experience with the neonate (Jensen, 
1965). Parallel findings have been 
reported for interspecific attachment 
development in maternal ewes and 
does (Hersher, Richmond, & Moore, 
1963a). Moreover, the extent of dis- 
ruption shown by the subhuman pri- 
mate mother during separation is neg- 
atively correlated with the degree of 
sensory access that she has to the 
infant (Tensen & Tolman, 1962). 

There have been few attempts to 


^relate the extensity of behavioral dis- 


ruption of S to the tendency of S to 
reinstate the missing object, though 
such a relationship has been frequently 
assumed (e.g., Jensen, 1965). In one 
study of this contingency. behavioral 
disturbance upon cohabitant removal 
was found to be a good predictor of 
U-maze choice behavior (Cairns, in 
press). Lambs that were most dis- 
turbed upon separation from dog 
cohabitants tended to select these ob- 
jects rather than conspecific animals. 
Data are not yet available on the corre- 
sponding relationship in other mam- 
malian species. 


Effects of Continued Separation 


Short-term separation. The present 
model proposes that separation from an 
attachment-object initiates a relearning 
process. If this be the case, then length 
of the separation interval should be 
inversely related to the amount of be- 
havioral disruption observed. Where 
several different intervals of “social 
deprivation" (ie. hours since separa- 
tion from cohabitant) have been sam- 
pled, subjects (lambs) show consist- 
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ently less disruption aa + 
time (Cairns, in press, Exp. 2). (as 
reatest decrement in disnuption $ 
E ured by activity level and vocali- 
icem: occurred in the first 4 hours 
E separation, with a gradual unix 
thereafter toward preseparation exe!s 
of activity. A similar trend, but over 
a much shorter period (i.e., 10 min- 
utes) was found for young dogs which 
had been repeatedly placed in an un- 
familiar context (Elliot i & Scott, 

1961). If the period of isolation is 

extended over several hours, the dis- 

turbance functon obtained for 30 pups 
drawn from the same age-breed popula- 

tion studied by Elliot and Scott (1961) 

is quite similar to that observed in 

ungulates (unpublished data of the 
writer). £ 

One other line of evidence requires 
comment. The studies of Stanley and 
his collaborators (Bacon & Stanley, 

1963; Stanley & Elliot, 1962; Stanley, 

Morris, & Trattner, 1965) have been 

viewed as being pertinent to the prob- 

lem of social “satiation.” Dogs which 
have been exposed to the testing con- 
text, or to some significant portion of 
it (maze-human, or either maze or 
human separately), immediately prior 
to a test trial, tend to run slower than 
if preexposure had not occurred, Tt 

Seems reasonable to suggest, along the 
lines indicated by Stanley (1965), that 
the testing context itself changes in cue 
function, probably as a joint function 
of the number of repeated trials and 
the intertrial interval. Further study 
of the "satiation" phenomenon might 
include exploration of the effects of 
different intervals of prolonged habita- 
tion in the testing context (ie, a pro- 
cedure analogous to extended cohabi- 
tation). 

Extended separation. Few studies 
have been directly concerned with the 
effects of extended separation upon a 
previously formed attachment. The 


data that are available indicate that 
prolonged separation from an original 
cohabitant, and placement with a dif- 
ferent animal, leads to a reversal of 
the original preference in lambs (Cairns 
& Johnson, 1965; Cairns, in press, 
Exp. 3). Although the reversal oc- 
curred more rapidly when the “new” 
cohabitant was of the same species as 
S, the essential reversal occurred when 
the “new” O, was a dog (Cairns, in 
press). 

Studies of the postemergence attach- 
ment behavior of animals reared in iso- 
lation are also relevant. Although 
there have been several studies of the 
effects of isolation, few investigators 
have been concerned with the influence 
of subsequent experience on the attach- 
ment behavior of isolated animals. In 
addition, where Teports have appeared, 
the tendency has been to highlight the 
persisting effects of the treatment 
rather than the ameliorative influence 
of the postisolation experience, 

Many of the effects of isolation ap- 
pear, in fact, to be at least partially 
reversible (e.g. Fuller, 1966. Fuller & 
Clark, 1966h); Meier, 1965; Seay, Alex- 
ander, & Harlow, 1964). Ina review 
of primate Studies, Mason & Riopelle 
(1964) Suggest that the Severity of the 
consequences of social isolation is 
roughly Proportional to the length and 
amount of restriction, More work is 
called for on the mechanisms by which 
these effects are mediated, T, seems 
not unreasonable to Suggest, in this re- 
gard, that the context in which S 38 
placed may be partly responsible for 
the persistence of “asocial” responses. 
For example, a lamh which has been 


removed from the maternal ewe is re- 
Jected by aj] €wes, at least temporarily, 
When rej 


ntroduced to the flock. The 
Teported failur, 


€ of isolated ungulates 
to develop "norma]" conspecific rela- 
tions may he aS much an outcome of 
the flock’s Tesponse to S as the con- 
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verse (Hersher, Richmond, & Moore, 
1958; Scott, 1945). 

Punishment and separation. Social 
attachments are not necessarily in- 
hibited or extinguished by punishment. 
Attachments form even when S is bit- 
ten, mauled, or otherwise abused by O, 
(Cairns & Johnson, 1965; Rosenblum 
& Harlow, 1963; Seay, Alexander, & 
Harlow, 1964). These observations 
were made, however, under special 
conditions of environmental restriction : 
Neither S nor O, were permitted to 
escape, and alternative objects were not 
present. 

A different outcome is obtained un- 
der less restrictive circumstances. In 
naturalistic or semi-naturalistic set- 
tings, continued punishment is typically 
associated with physical separation and 
extinction of the social attachment 
(eg. Altmann, 1960). Whether pun- 
ishment is an antecedent or a conse- 
quent of attachment diminution unfor- 
tunately cannot be determined from 
field studies. 

. Other reports suggest that matura- 
tion events initiate chains of reciprocal 
responding which culminate in punish- 
ment and attachment extinction. If, 
for instance, suckling responses per- 
sist after a given stage of bone and 
musculature development, they pre- 
sumably are noxious and elicit maternal 
punishment and avoidance (Schneirla 
& Rosenblatt, 1961; Rheingold, 1963). 
Similarly, young dogs cohabited indi- 
vidually with an adult rabbit tend to 
sleep beside or atop him, and to spend 
most of the time that they are awake 
oriented toward him, for example, tug- 
ging his tail, mouthing his ears, 
scratching his neck. If the pups are 
young (28-32 days old), the rabbits 
remain passive and sometimes become 
involved in reciprocal grooming. By 
the time the dogs are 60-70 days old, 
however, the picture of peaceful co- 
existence is inverted. Responses of the 
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pups that were once harmless now in- 
flict serious physical damage and elicit 
vigorous avoidance and defensive bit- 
ing by the rabbit (unpublished study 
conducted by the writer and J. Wer- 
boff). 

Moreover, it would he inappropriate 
to focus only upon changes in the off- 
spring. Since the maturation of the 
young is usually synchronized with 
maternal physiological changes, it 
seems reasonable to assume tentatively 
that the occurrence of punishment is 
determined by modifications in the bio- 
physical status of both. 

Finally, punishment in itself would 
not be expected to diminish attachment 
strength. According to the present 
analysis, noxious stimulation would in- 
fluence mutual attachments only if the 
animals move or orient away from each 
other. Insofar as available data are 
pertinent, they seem consistent with 
the proposal. 


Organismic Variable 


2s 

Age of subject. Most of the work 
on attachment behavior has been con- 
ducted with young animals as subjects. 
But whatever mechanisms are respon- 
sible for the development of social at- 
tachment behavior in the young appar- 
ently are not entirely eliminated as the 
animal matures. Maternal females, at 
least, demonstrate a similar capacity 1n 
adulthood. Some of the factors that 
may facilitate mutual mother-young re- 
ciprocal conditioning have already been 
discussed: the physical or functional 
isolation of mother-young units from 
competing associations, the continuous 
involvement of one animal across a 
large number of the response systems 
of the other. In addition, certain age- 
related structural and physiological 
changes that might diminish attachment 
strength and preclude substitutive at- 
tachments have been noted. Other 
potentially effective variables, such as 
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" EE nges in 
matirakonal expeniential : oo 
sensory as py Bc explored in 
a yeo sr dec upon attachment 
terms ibaa extinction. While such 
nad changes doubtless ge 
the essential conditioning Longe 
would be premature to assign di e 
ential weights to the presumably ef- 
fective variables. That task can best 
await the accumulation ofa larger body 
of relevant data than is now available, 
On the other hand, the empirical 
analysis of the ontogenesis of attach- 
ment phenomena would likely be facili- 
tated by appropriate assumptions about 
the nature of the process itself. It is 
with respect to this matter that the 
working assumption of the present 
model can be underscored. All social 
attachments, regardless of the object 
(mother, peer, mate, offspring, ete.) or 
the age of S (infancy, juvenile, adult) 
are presumed to reflect a single proc- 
€ss; namely, the associative condition- 
ing of S’s response subsystems to the 
cues provided by Q's presence, 
Genetic variation, In view of the 
differences that obtain between the 
various mammalian orders in terms of 
structure, breeding and Tearing pat- 
terns, and presumed capabilities for the 
establishment of conditioned responses, 
it would be surprising if corresponding 
interspecific differences in attachment 
behavior could not be identified, 
Whether these differences can be 
shown to be a direct function of phylo- 
genetic rank is, of course, another mat. 
ter. Pertinent to this Issue are the 
findings of recent studies of “social 
reinforcement” phenomena in rodents, 
that is, where a mature S is permitted 
to view a conspecific animal if S per- 
forms a particular response. Relative 
to other outcomes, social reinforcement 
events provide only modest contro] Over 
the behavior of rodents (e.g., Allison 
& Rocha e Silva, 1965; Angermeier, 
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1962; Antonitis & Baron, 1964; An- 
tonitis & Kish, 1955). These results 
stand in sharp contrast to findings ob- 
tained in similar test arrangements with 
“higher” mammalian Species (e.g., un- 
gulates, primates), Without additional 
Comparative study, it is unclear whether 
the apparent discrepancy reflects the 
operation of genetic, 
condition variables. 
Cross-specific differ 
necessarily be in coni 
ent model. Its utilit 


may lie in the Specification of the mech- 


anisms whereby bets 'een-breed or be. 
tween-species differen 


age, or outcome 


ences would not 
ct with the pres- 
y in this regard 


Conciuping 


The task of this 
velop an integrated 
count of soci 
mammals, 
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lus setting in the control of behavior, 
it underscores the need for a detailed 
analysis of molar response patterns, 
on the one hand, and the factors in 
the stimulus setting that support these 
responses, on the other. The model 
thus forces attention onto the mecha- 
nisms by which behavioral stability is 
achieved, and the processes by which 
the effects of prior experience are me- 
diated. If a microanalysis of molar 
responses can clarify the amorphous 
construct of “general activity” (Bin- 
dra, 1961), it might be able to perform 
a similar service for the social be- 
havior rubrics of “dependency” and 
“attachment.” 
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TESTS FOR SPONTANEOUS ALTER 


ERIC W. HOLMAN* 


University of Pennsylvania 


Predictions about sequential properties of 2-choice learning data are 
derived from a simple assumption about the effects of rewarded and un- 
rewarded choices. These predictions are used as tests for spontaneous 


alternation in T-maze experiments. 


Given massed trials, rats show 


spontaneous alternation during position and reversal training as well 
as during training with both responses rewarded, but these rats are 
likely to show such alternation on a given trial if and only if they have 


alternated on the previous trial. 


Many theories of simple two-choice 
learning include the following as- 
sumption : The probability of choosing 
a particular response does not decrease 
when the response is chosen and re- 
warded, and does not increase when 
the response is chosen and not re- 
warded. This assumption, although 
generally consistent with the overall 
changes in behavior produced by 
various schedules of reward, is oc- 
casionally less successful in accounting 
for the detailed sequential properties 
of such behavior. Since most of these 
failures have been observed in rather 
specialized experiments, however, 
their importance is open to some 
question. Therefore, in order to 
determine more systematically the 
conditions under which the reward 
assumption is valid, it would be useful 
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to derive from the assumption detailed 
consequences that can be tested in 
standard experimental situations. 
One such consequence has been stated 
by Bush (1963, pp. 466-467); this 
test has been used to demonstrate the 
influence of the gambler's fallacy in 
binary prediction experiments. Other 
consequences are derived in the pres- 
ent paper; these tests turn out to be 
particularly sensitive to spontaneous 
alternation in T-maze experiments. 

Some response alternation is almost 
certain to occur in any learning proc- 
ess simply because of binomial varia- 
bility and perhaps also because of 
changes in response probabilities pro- 
duced by the reward schedule. Spon- 
taneous alternation, on the other 
hand, is generally attributed to some 
additional cause such as fatigue, bore- 
dom, or curiosity. Clearly, it is un- 
necessary to invoke any such addi- 
tional cause, however, unless simpler 
processes are proved to be insufficient. 
Hence, spontaneous alternation can 
be defined as alternation that cannot 
be explained in terms of the general 
reward assumption introduced in the 
previous paragraph. The present 
paper shows how much alternation is 
consistent with the assumption under 
various schedules of reward. 

In the derivation of these results, 
the following notation will be used. 
Experimental trials will be numbered 
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d. ome us en BES and y 
denote the two responses available to 
the subject (S). Since some results 
are derived which apply to any 
particular sequence of experimental 
events such as stimuli, responses, or 
outcomes, let 4 denote an arbitrary 
sequence of such events. Let the 
indicator random variable X, have 
the value 1 if the response r is chosen 
on Trial z, and the value 0 otherwise; 
similarly, let the random variable Y,, 
have the value 1 if the sequence of 
events / ends on Trial z, and the value 
0 otherwise. Finally, let Pn represent 
the probability that .S chooses re- 
sponse r on Trial n. 
Two sequential properties of re- 
sponse data will be considered: the 
repetition of responses and the length 
of response runs. To represent repe- 
tition of either response, let J, 
= XXn + (1 — X,)(1 — Xa); 
clearly, J» has the value 1 if the same 
response is chosen on Trials n and 
n + 1, and the value 0 otherwise. To 
represent repetition of the response v 
following the sequence of events ¢, the 
product ¥zsX,Xay1 can be used ; 
this product has the value 1 if / ends 
on Trial z — 1and r is chosen on Trials 
n and n + 1, and the value 0 other- 
wise. To represent short response 
runs, let K, = (1 —Xni)X (1 —Xagi) 
and K'a = X,4(1 — Xn) Xaa; for 
instance, K, has the value 1 if an 
r run lasting only one trial occurs 
on Trial n, and the value 0 other- 
wise. To represent longer runs, 
let La = X, a4X.(1 — X.) and d 
-(l—2X3)(-—3X3X.u for in- 
stance, L, has the value 1 if an 7 run 
lasting more than one trial ends on 
Trial z, and the value 0 otherwise. It 
Should be noted that these run ran- 
dom variables do not count runs that 
are terminated by the end of an ex- 
periment or short runs that occur on 
the first trial of an experiment. 
Clearly, the sum over trials of any 
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of these random variables represents 
the total number of times the cor- 
responding response repetition or re- 
sponse run occurs an experiment. To 
simplify notation, all such sums will 
be understood to run from n = N 
through x = N — 1. Predictions 
about the expected values of these 
sums will now be derived from various 
cases of the reward assumption. No 
predictions are made, however, about 
the variances of these sums, and it 
Seems unlikely that such predictions 
can be derived without stronger 
assumptions. This limitation is not 
crucial, however, because variances can 
be estimated from data if necessary. 


RESPONSE REPETITION 


The hypothesis of the first theorem 
requires that a choice of either re- 
sponse increases the expected proba- 
bility of choosing the same response 
again. According to the reward 
assumption, therefore, this theorem 
can be applied to experiments in 
which both responses are consistently 
rewarded. 


Theorem 1 
If for all pa with N’ &wizN-—1, 
E(biailps, Xn = 1) > Pn 
and 
E(nyi| Pn, X, =O} Ba, 
then 


Proof. For any pp, 
E(In| Pn) =ELX,X, #1 
PU Kua 
=P’ (Pngi| Pn, X421) 
+(1—p,)[1 
SE (Papil Pa, X,—0)] 
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The first five lines above are true by 
definition. The sixth line is derived 
by substitution of the hypotheses of 
the theorem in the third, fourth, and 
fifth lines. The last line follows be- 
cause the function p,2 + (1 — ppn)? has 
an absolute minimum equal to 1. 
Since this result holds for any pn, it 
is possible to take expectations over 
all p,; then, to complete the proof, it 
is necessary only to sum over 7 and 
reverse the order of summation and 
expectation. 

The conclusion of Theorem 1 states 
that during Trials NV’ through V — 1, 
the expected number of repetitive 
responses should be at least half the 
total number of trials. In an ex- 
periment in which both responses are 
rewarded, therefore, an insufficiency 
of repetitive responses would suggest 
that the reward assumption is in- 
validated by spontancous alternation. 
This theorem thus gives a rigorous 
justification of the conventional test 
lor spontaneous alternation in ex- 
periments with reward for both re- 
sponses. This test is not very sensi- 
tive, however, unless both responses 
are chosen about equally often; in- 
deed, if one response is chosen on at 
least three-quarters of the trials, that 
response must necessarily be repeated 
on at least half the trials, whether or 
not the reward assumption holds. 
Morcover, the test is not even justified 
in experiments without reward for 
both responses; in the absence of 
additional information about the dis- 
tribution of response probabilities and 
the form of the learning process, the 
reward assumption does not restrict the 
expected value of J, on trials on which 
one or both responses are unrewarded. 


GENERALIZED RESPONSE REPETITION 


The hypothesis of the next theorem 
generalizes the hypothesis of Theorem 
lin two respects. First, for only one 
of the responses is performance re- 


quired to increase expected response 
probability; thus, the theorem can be 
applied to experiments in which only 
one response is rewarded. Second, 
this increase in response probability 
is only required to occur if a particular 
sequence of events has ended on the 
previous trial; thus the theorem can 
be used to study the influence of 
previous events upon spontaneous 
alternation. 


Theorem 2 
If for all p, with VW’ € » € N — 1, 
E(bsilpbu Yr-i1X, = 1) > pa, 
then 
E(EY nX Xn) 5 E(ZY,aX,) 
E(ZYnXn) E(XY.a) ' 
Proof. By hypothesis, for any pn, 
E(X&Xial|fs, | ow = 1) 
= Prk (Pryl Pn, VnrXn = 1) 
Z Pr. 
Hence, if expectations are taken over 
all pa, with Y,_; equal to 1, 
E(XaXaşı| Yai = 1) 
> E(p.?| Yra = 1) 
> [E(pal| Yai 27 
This result can be substituted in the 
numerator of the second fraction in 
the derivation below; the rest of the 
derivation follows from well-known 


properties of expectations and a few 
obvious cancellations: 


BW a aX Xa) 
BVa 
E(Yn)E(XaXnn| Yai = 1) 
= — EQ-QEQG] Yam = 1) 
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Consequently, by cross-multiplication, 


NEY a-)E(Y iX aX a) 
= E Yaaa). 
This result can be substituted to give 


the third line below; the first two lines 
follow from the Schwartz inequality: 


EZE (Yn) ] EXE Q' X ,X,44)] 


> [ZVE(Y, )E(Y aX, Xn)! 
> [ZE(Y,.X,)]. 


To complete the proof, it is necessary 
only to cross-divide and reverse the 
order of summation and expectation. 

This theorem states that during 
Trials N’ through N — 1, the expected 
number of times the event ¢ is followed 
by at least two consecutive choices of 
the response z, divided by the expected 
number of times ¢ is followed by at 
least one choice of 7, must be at least 
as great as the expected number of 
times ¢ is followed by at least one 
choice of r, divided by the expected 
number of times / occurs. Thus, 
Theorem 2, like Theorem 1, places a 
lower bound on the expected number 
of repetitive choices, but the lower 
bound in Theorem 2 depends upon the 
relative frequency of the responses, 
This theorem can therefore be used to 
detect spontaneous alternation even 
if one response is chosen much more 
often than the other. 

A special case of Theorem 2 is of 
particular interest. If both responses 
are rewarded, the theorem can be 
applied to 7’ as well as to r; in order 
to eliminate the dependence upon 
previous events, / can be identified 
with some event that occurs on every 
trial. This case of the theorem leads 
to the following lower bound on the 
expected number of repetitive choices 
of either response: 


E(XJ,) E(X.) F 
N-wW-| NW 


EX.) F 
+ [1 yo ak 


This result thus justifies, in experi- 
ments with reward for both responses, 
the test suggested by Dember and 
Fowler (1958) for detecting spon- 
taneous alternation in spite of re- 
sponse biases. The above result is 
also clearly stronger than the con- 
clusion of Theorem 1. In other words, 


Theorem 2 contains previous tests as 
à special case. 


REsPONSE RUNS 


The hypothesis of the third theorem 
requires that the expected probability 
of choosing r increase on each trial, 
no matter which response was chosen 
on the previous trial. The specifica- 
tion of the previous response (that is, 
Xn-1) appears to be mathematically 
necessary but psychologically unim- 
portant. According to the reward 
assumption, this theorem can be 
applied to experiments in which one 
response is rewarded and the other 
response is not rewarded. 


Theorem 3 
If for all p, with N SSN 


E(ba|bu Xni) > p, 
for X, equal to 1 or 0, then 


EGL', — xK,)qo, 1] 


— 1, 


and 
E(ZL, = EK')«q—1, 0], 


where K,, K'a L, and I/ 
random variables defined 

Proof. By 
and X 


n are the run 
earlier. 


hypothesis, for any p, 
n—1; 


EQG — X, |, ed 


x: E(Pnsi| Pn, Aaa) r3 Pa 
20. 


i 
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Hence, if expectations are taken over 
all pa, for any Xna, 


E(Xngi — Xa Xn) > 0: [1] 


According to the definitions of the run 
random variables, 


L.—K.--X, a) —X:)Xzua 
—X.ü—2X54)] 
=(1 = Kani) (X na =X s) ; 
A Ly= Xp al(t—Xy)X nga 
—X.0—2X543)] 
—XoaQOCG4—X,); 
Ly EE Lun (Eua 1). 


It is next possible to take expectations 
in these results and then to substitute 
Equation 1: 


E(L'.— K,) 

=E=) 

E(X nt> XX, 420) 

20; [2] 
E(K',—L,) 

—E(X,) 

GGG X. X, a1) 
20; [3] 


EQ/,— K,)3M-E(K',—L,) 
—EX aX). 


ie the expectations in Equations 
and 3 are both nonnegative, neither 
can exceed their sum, Hence 


E(L's — Ks) € ER ma — x; [4] 
E(K's — La) £ E(Xny1 — X,). [5] 


To prove the first Statement in the 
conclusion of the theorem, it now 
suffices to sum Equations 2 and 4 Over 
n and reverse the order of summation 


and expectation: 
E[X(L' — K,)] > 0; 


E[S(L'n — Ka)] € EIZ(X.a — X,)] 
= E(Xy — X'y) 
1. 


I^ | 


The second statement in the conclu- 
sion can be derived from Equations 
3 and 5 in a similar manner. 

The conclusion of Theorem 3 states 
that during Trials N’ through N — 1, 
the expected number of long runs of 
one response should not differ by more 
than 1 from the expected number of 
short runs of the other response. 
This conclusion implies, moreover, 
that the expected total number of long 
runs should not differ by more than 1 
from the expected total number of 
short runs. In a simple position- 
training experiment, therefore, a pre- 
dominance of short runs would in- 
dicate that the reward assumption is 
invalidated by spontaneous alterna- 
tion, and a predominance of long runs 
would indicate that the reward as- 
sumption is invalidated by spontane- 
ous perseveration. Thus, Theorem 3, 
unlike the previous theorems, can be 
used to study perseveration as well 
as alternation. For instance, this 
theorem could be used to determine 
whether the increases in perseverative 
behavior observed to follow certain 
brain lesions and drug treatments 
actually represent spontaneous per- 
severation not explainable by simple 
reward effects. At present, however, 
data suitable for this purpose do not 
seem to be available. 


EXPERIMENTAL EVIDENCE 


Most of the previous experiments 
on spontaneous alternation have al- 
ready been reviewed by Dember and 
Fowler (1958). The criterion es- 
tablished in Theorem 1 has frequently 
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been used as a test for spontaneous 
alternation in experiments in which 
both responses were rewarded and 
also, incorrectly, in experiments in 
which neither response was rewarded. 
Fortunately, the presence or absence 
of reward does not appear to have 
made much difierence in these studies. 
The variable that seems most strongly 
to have influenced alternation is 
the distribution of trials. Heathers 
(1940), Montgomery (1951), Riley 
and Shapiro (1952), and Walker 
(1956) have all reported that sponta- 
neous alternation occurs with massed 
trials, but not with spaced trials. 
‘Theorems 2 and 3 permit the study 
of spontaneous alternation to be 
pursued in greater detail and extended 
to experiments without reward for 
both responses. To illustrate the use 
of these theorems, a new series of 
experiments was run, and a new 
analysis was performed on data previ- 
ously reported by Galanter and Bush 
(1959) and Fey (1960, 1961). In the 
new experiments and also in the 
Galanter-Bush and Fey experiments, 
rats were run in a T maze with food 
reward. In some of the new experi- 
ments, the general conditions of 
previous studies were reproduced by 
giving reward for both responses with 
massed trials. In the rest of the new 
experiments, these conditions were 
changed by giving reward for only one 
response with massed trials. In the 
Galanter-Bush and Fey experiments, 
the conditions were further changed 
by giving reward for one response 
with spaced trials. 


METHOD 


The new experiments will now be described 
in greater detail. The subjects (Ss) were 27 
female Sherman-strain albino rats, obtained 
at the age of 90 days from Scott Farms, 
Stroudsburg, Pennsylvania. of these rats, 
4 were discarded early in training for fail- 
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ure to run the maze properly; the rest of this 
report concerns the remaining 23 Ss. 

The apparatus was an automatic T maze 
in which a rat could run for long sessions of 
consecutive trials without the aid of the 
experimenter (E). The stem section of this 
maze was 40 inches long, the arm sections 
were at right angles to the stem and 24 inches 
long, the goal sections were parallel to the 
stem and 24 inches long, and the sections 
through which the rat could return from the 
goal to the stem were at acute angles to the 
stem and 32 inches long. All sections were 
4 inches wide and 5 inches deep. The walls 
were of j-inch plywood, painted flat black, 
and the floor and ceiling were of hardware 
Cloth. Sections were separated by guillotine 
doors; 12 inches beyond each door was a 
photobeam. When the rat passed through 
the photobeam, the door ahead was raised 
by a solenoid, and the door behind was 
dropped; the raising of the door could be 
delayed by a timer. A drinking tube pro- 
jected into the stem section Írom a water 
bottle resting on top. A food cup was 
recessed into the wall of each goal section. 
When the rat passed through the photobeam 
in the goal section, a 45-milligram Noyes 
pellet could be dropped into the cup as a 
reward. The schedule of rewards was 
programmed in advance, and the sequence 
of choices made by S was recorded auto- 
matically. 

The maze, control apparatus, and Ss’ 
individual home cages were all kept in the 
same room. The room was partially sound- 
proof, continuously illuminated, and main- 
tained thermostatically at 75? F. Water was 
always available to Ss. Food, consisting of 
Purina lab chow and the Noyes pellets 


earned in the maze, was limited to 10 grams 
a day. 


After 3-6 weeks of adaptation to handling 
and food deprivation, each .S received a short 
program of manual pretraining. The rat was 
first given six Noyes pellets in its home cage, 
then placed in the goal section of the maze 
and rewarded, five times on each side, and 
finally placed in the arm section and rewarded 
in the goal section, five times on each side. 
The first experimental session immediately 
followed pretraining on the same day; later 
sessions were run at approximately the same 
time on consecutive days. The rat was placed 
in the stem section of the maze to start the 
first trial of a session and was removed from 
the return section after the last trial. If the 
rat failed to complete any trial within 5 
minutes, the solenoids raising the doors were 
all automatically turned off to prevent over- 
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heating; E entered the room and restored 
the power when the solenoids were cool. Such 
events occurred in less than one-sixth of the 
sessions; otherwise, E did not enter the room 
during a session. 

Two groups of Ss were given reward for 
both responses. In Group 1, Ss were not 
delayed in any section of the maze; in Group 
2, 5s were delayed for 10 seconds in the stem 
section on each trial. Sessions contained 220 
trials, because 220 pellets exactly completed 
a 10-gram ration. Groups 1 and 2 contained 
live and four rats, respectively. Except for 
two Ss in Group 1 who were run for seven 
sessions, all Ss were run for at least eight 
sessions. As the extra sessions did not reveal 
any changes in behavior, only the first eight 
sessions for each .S will be considered hence- 
forth. 

Two other groups were given position and 
reversal training. In Group 3, Ss were not 
delayed in any section; in Group 4, Ss were 
delayed for 10 seconds in the stem section on 
each trial. Sessions contained 200 trials, 
divided into four 50-trial training periods. 
Reward was given on one side for the first 
training period, on the opposite side for the 
next period, and so on; half the Ss were first 
rewarded on the right, and half on the left. 
h group contained seven rats, all of which 
were run for four sessions. 

The Galanter-Bush and Fey experiments 
have already been described in the original 
publications; the design of these experiments 
will be summarized here. Only the groups 
given conventional position and reversal 
training will be considered. In Groups 3 and 
+ of the Galanter-Bush experiment, Ss were 
given three trials a day, separated by 30 
minutes, for a total of 192 trials. The 20 
rats in Group 3 were rewarded on the right 
for the first 48-trial period, on the left for the 
next 48-trial period, and so on; the 19 rats in 
Group 4 were rewarded on the right for a 
144-trial period, and on the left for a 48-trial 
Period. In Group 1 of the Fev experiment, 
Ss were given one trial a day for a total of 
65 trials. The 25 rats in this group were 
rewarded on the right for a 30-trial period, 
and on the left for a 35-trial period. 


RESULTS 


Of Ss given reward for both re- 
sponses in the new experiments, 
nearly all developed fairly consistent 
biases toward one side or the other, 
although no S chose the same side on 
more than nine-tenths of the trials in 


any day. The overall percentages of 
choices of the left side for each S were 
as follows: in Group 1—30, 37, 50, 51, 
71; in Group 2—26, 32, 41, 66. 

Of Ss given position and reversal 
training in the new experiments, all 
learned to choose the correct side on 
most trials toward the end of each 
50-trial training period. Not sur- 
prisingly, Group 4, given a short 
delay between trials, appeared to 
learn slightly faster than Group 3. 
As in previous experiments on re- 
peated reversal learning, both groups 
tended to learn faster in later training 
periods than in earlier periods. Com- 
pared to the groups given similar 
training in the Galanter-Bush and 
Fey experiments, Groups 3 and 4 
appeared to learn more slowly in the 
first daily session, at about the same 
rate in the second session, and faster 
in the last two sessions. 

The prediction stated in Theorem 2 
can be tested in the new experiments 
and also in the Galanter-Bush and 
Fey experiments. According to the 
hypothesis of the theorem, the re- 
sponse 7 can be identified with either 
a left turn or a right turn in Groups 
1 and 2, but must be identified with 
the correct response in the other 
groups; to investigate the influence of 
the previous response, the event 7 can 
be identified with either response. 
Each row of Table 1 presents tests of 
the prediction for a single group with 
a single identification of the response 
r. 'The Galanter-Bush and Fey ex- 
periments are denoted by obvious 
abbreviations, and the response identi- 
fied with 7 is indicated in parentheses 
for Groups 1 and 2. In the tests 
labeled “Previous response same, t 
is identified with the same response 
as r; in the tests labeled "Previous 
response different," f is identified with 
the opposite response. Each test is a 


i r ios of sums; 
comparison of two ratios 
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TABLE 1 
TESTS OF THE PREDICTION IN THEOREM 2 
Previous response same Previous response different 

ines Group tests* | S tests Block tests’ | Group tests» S testsb Block tests» 

60 > 36 25-8 | 11-0 23 « 70 1-37 0-11 
i2 57 > 30 22-7 | 11-0 19 < 64 0-38 0-11 
2(R) 67 > 58 13-17 | 10-1 51 « 60 6-26 0-11 
2(L) 62 > 40 16-12 | 11-0 26 < 42 1-31 0-11 
3 92 >75 28-0 8-1 48 < 71 5-23 1-10 
4 91 >85 27-1 9-0 58 < 60 12-16 1-10 
GB3 90 > 84 20-0 8-1 62 > 53 14-6 7-4 
GB4 95 > 92 19-0 4-1 62 > 35 19-0 5-0 
F1 90 >84 | 241 | 4 63 > 39 22-2 4-1 


n Theorem 2 predicts that the second figure will not exceed the first. " " > 
ts confirming the prediction, and the second figure is the number of tests 


b The first figure is the number of tes 
contradicting the prediction. 


three sets of tests, differing in range of 
summation, are presented. In the 
Group tests, the sums are taken over 
all Ss and days for each group. The 
table states the ratios of these sums, 
expressed as percentages; the theorem 
predicts that the first percentage will 
be at least as large as the second. In 
the S tests, the sums are taken over 
each daily session for each S in the 
new experiments, and over all days 
for each S in the Galanter-Bush and 
Fey experiments. The table states 
the number of tests confirming and 
contradicting the prediction, not 
counting tests in which the denomi- 
nator of one of the ratios is less than 5. 
In the Block tests, the sums are taken 
over all Ss, and also over all days in 
the new experiments, for each of the 
following blocks of trials. In the 
groups given reward for both re- 
sponses, a session is divided into 10 
blocks of 20 trials followed by a block 
of 18 trials. In the groups given 
position and reversal training, the 
blocks are chosen to separate three 
different stages of training: early, 
when the group proportion of correct 
responses is less than about 35; 
middle, when the proportion correct 
is between .35 and .85; and late, when 


the proportion correct is greater than 
-85. Using these criteria, the first 
training period in a session is divided 
into a block of 20 trials, and then a 
block containing the rest of the period ; 
except in Group 4 of the Galanter- 
Bush experiment, each later training 
period is divided into a block of 5 
trials, then a block of 20 trials, and 
then the rest of the period; in the 
Galanter-Bush Group 4, the second 
period is divided into a block of 20 
trials, then a block of 15 trials, and 
then the rest of the period. The table 
again states the numbers of tests con- 
firming and contradicting the predic- 
tion, not counting tests in which the 
denominator of one of the ratios is 
less than 5. 

The Group tests show a fairly clear 
pattern of results. In the Galanter- 
Bush and Fey experiments, the pre- 
diction is confirmed in all tests, but 
in the new experiments, the prë- 
diction is confirmed only when ¢ and r 
are identified with the same response 
and is contradicted when t and r are 
identified with different responses. 
The S and Block tests generally 
although not unanimously corroborate 
these results. Most of the exceptions 
occur in the S tests: in Group 2 with 
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t the same as 7, and in Group 4 with 
t different from r. 

The prediction stated in Theorem 3 
can be tested in all the groups given 
position and reversal training. The 
response r is always identified with 
the correct response. Each row of 
Table 2 presents tests of this predic- 
tion for a single group. Each test is a 
comparison of the mean number of 
long runs (of length greater than 1) 
per training period with the mean 
number of short runs (of length 1) 
per training period; two sets of tests, 
differing in range of averaging, are 
presented. In the Group tests, the 
means are taken over all Ss and days 
for each group. The table states the 
mean number of long runs followed by 
the mean number of short runs; the 
means are indicated as equal if they 
do not differ by more than 1, as 
predicted in the theorem. In the S 
tests, the means are taken over each 
daily session for each S in the new 
experiments and over all days for 
each S in the Galanter-Bush and Fey 
experiments. The table states the 
numbers of tests showing more long 
runs than predicted, run lengths 
consistent with the prediction, and 
more short runs than predicted. 
Block tests are not presented because 
such tests are rendered relatively in- 
sensitive by the small numbers of 


TABLE 2 
TESTS or THE PREDICTION IN THEOREM 3 


Group Group tests» S testsb 
3 5.7 < 12.2 0- 3-25 
4 9 1- 7-20 
GB3 5.4 3- 7-10 
GB4 6.1 6- 7-6 
F1 3.9 9-12- 4 


* Theorem 3 predicts that the two figuri vi 
indicated as equal. amb eee wil Be 
b The first figure is the number of tests showing m 
long runs than predicted, the second figure is the DUBE 
of tests confirming the prediction, and the third figure 
is the number of tests showing more short runs than 

predicted. 


435 


runs that occur during short blocks 
of trials. 

The Group tests again follow a 
fairly clear pattern. In the Galanter- 
Bush and Fey experiments, the pre- 
diction is confirmed as before, but in 
the new experiments, too many short 
runs are observed. The S tests again 
generally corroborate these results. 
As before, more exceptions occur in 
Group 4 than in Group 3. 


Discussion 


The foregoing analysis indicates 
that spontaneous alternation is more 
prevalent than has previously been 
demonstrated. Using Theorems 2 
and 3, spontaneous alternation was 
detected in the new experiments al- 
though not in the Galanter-Bush or 
Fey experiments. Hence, spontane- 
ous alternation requires massed trials 
but not reward for both responses. 
In other words, given massed trials, 
Ss show spontaneous alternation not 
only when they have no reason not to 
alternate, but also when they must 
learn to repeat a correct response. 
This fact thus suggests that the 
original reward assumption is gen- 
érally inapplicable in massed-trial 
T-maze experiments, and that such 
experiments are consequently not 
very useful for the detailed study of 
simple reward effects. A 

The more detailed tests also in- 
dicate that spontaneous alternation 
may depend upon previous responses. 
Using Theorem 2, spontaneous alter- 
nation was generally detected only 
when £ and r were identified with 
different responses. Hence, Ss are 
likely to show spontaneous alternation 
onagiven trialif, and only if, they have 
alternated on the previous trial. Un- 
fortunately, this finding could be an 
artifact caused simply by fluctuations 
in the overall frequency of alternation. 
Although such a possibility cannot in 
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principle be completely eliminated, 
the S and Block tests suggest at least 
that the observed results are not 
attributable to differences between 
Ss, days, parts of a day, or stages of 
training. It therefore seems reason- 
able to conclude that spontaneous 
alternation reduces the probability of 
a repetitive response at the beginning 
of a response run, but generally does 
not further reduce this probability 
if the run persists. In position and 
reversal training, therefore, such alter- 
nation may be expected to retard but 
not to prevent learning to repeat the 
correct response. This fact thus 
explains why the original reward 
assumption may be plausible in the 
long run although incorrect in detail. 
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PROCESS OF ENUMERATION* 


MARY BECKWITH anp FRANK RESTLE 


Indiana University 


Mathematically, the process of enumeration is fundamental to arith- 
metic. Psychologically, it is a sensorimotor chain controlled at every 
stage by a shifting perceptual organization. Enumeration requires a 
chant (“1, 2, 3... ."), a shifting indicator response (pointing), and 
a perceptual grouping of objects into those already counted and those 
still ahead. The arrangement of the objects has, theoretically, an 
important effect on the speed and accuracy of enumeration. Further 
analysis shows that the serial chain of behavior, required for counting 
a fairly large set of objects, must be divided into parts, and the objects 
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grouped into corresponding subsets. 
ship between arrangement of objects and counting. 


At the foundation of applied arith- 
metic lies the ability to enumerate sets. 
A basic procedure in counting is to 
chant the numerals, "ONE, TWO, THREE 

->° at each word, pointing to one of 
the objects in the set to be counted. 
The chain of indicator responses is often 
Senerated from a basic response like 
pointing, by a modification like moving 
to the right. Then, if I is the indicator 
response and T is its transformation 
to get to the next object, the responses 
are I, T(I), T[T(I)],..., TECE): 
The indicator responses must touch 
every object once and stop when all 
the objects have been counted. This 
requires perceptual control, a sliding 
discrimination between the set counted 
(C) and the set as yet uncounted (U). 
As objects are shifted from C to U it 
is seen that at any stage n, Cna 2 C,, 
whereas U,4 C Up. The sequence of 
sets constitutes a linear array (Restle, 
1959b, 1961), is ordered, and provides 
a basis for simple measurement of dis- 
tance. Finally, the end of the process 
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3 experiments show the relation- 


of enumeration must be recognized, 
usually by the emptiness of U. We 
shall refer to the above process as 
"simple enumeration." 

Young children may fail to produce 
the chant correctly, but older subjects 
(Ss) presumably have difficulty mainly 
in the sensorimotor components of 
counting. Some of the factors in 
sensorimotor control of counting are 
seen when the experimenter (E) varies 
the arrangement of the objects to be 
counted. In simple enumeration, if 
the objects lie in a straight line, then 
the transformation of the pointing 
response T is merely a move to ( say) 
the right, and the discrimination be- 
tween C and U can be made relative 
to the location of the finger. If the 
objects are in a circle, the transforma- 
tion may again be a simple shift in 
the clockwise direction, but the dis- 
crimination between C and U becomes 
more difficult as S approaches the 
origin of his counting. This may lead 
to errors and perhaps to long delays 
if S starts over. When the objects to 
be counted are all alike, the circular 
arrangement should be very difficult, 
though the straight line is not affected. 


If the objects differ among themselves, 


say in shape, then the circular arrange- 
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ment should become easier. With a 
rectangular arrangement, again there 
is a simple path, and there should be 
no difficulty with the stopping rule. 
In simple enumeration the rectangular 
arrangement has no great advantages 
over the straight line, and, since it 
provides some optional paths, may 
even introduce delays. Finally, the 
objects may be scrambled—approxi- 
mately equally spaced but in no par- 
ticular pattern. This forces S to de- 
vise a path through the set; he must 
use a complex sequence of transforma- 
tions of the indicator responses, and, 
since no single cue distinguishes ele- 
ments of C from U, S must construct 
and remember cues as he goes along. 

Simple enumeration is not a com- 
plete psychological description of the 
process of enumeration because it takes 
no account of the fact that assemblies 
of discrete objects are perceived in 
groups. Of Wertheimer’s six prin- 
ciples (Woodworth & Schlosberg, 
1954, p. 409) the authors’ experiments 
permit variation of proximity, simi- 
larity, and good continuation. The Ss 
presumably perceive the field of objects 
as organized into groups. A natural 
question is whether these groupings 
have any function in counting. 

Enumerating as many as 16 objects 
is a long serial task, and § may organ- 
ize the segments into a convenient 
number of smaller subtasks. If so, 
then it is necessary to group objects 
conveniently to permit rapid and accu- 
rate counting; for example, if Ss need 
to group objects, the rectangular ar- 
rangement may be consistently better 
than the straight line. 


History or THE PROBLEM 


Although counting is basic to applied 
arithmetic, there is little literature of 
experimental studies dealing with 
counting. Many studies have dealt 
with the immediate apprehension of 
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number—a process supposed not to be 
mediated by steps, but to occur in- 
stantly. Freeman (1912) set the span 
of apprehension of circles of light at 
5 for adults and 4 for children over 
8 years; younger children were untest- 
able. Fernberger (1921) used random 
arrangements of 4—12 dots and found 
that the relative frequencies of correct 
responses made a continuous ogival 
function of number of dots shown. 
Correct judgments were made 5076 
of the time (Fernberger's measure of 
visual discrimination) in the range 
from 6 to 11 dots. Hunter and Sigler 
(1950) varied intensity and duration 
of presentation of dots and found that 
the usual visual laws applied. 

One approach to the span-of-appre- 
hension idea is as detection of a num- 
ber of targets (Casperson & Schlos- 
berg, 1950; Restle, Rae, & Kiesler, 
1961; Schlosberg, 1948). This ap- 
proach deals mainly with perception 
of a number of dots under short ex- 
posures or poor illumination, but may 
undermine the concept of a “magical 
number.” 

Warren (1897) arranged white 
squares in a vertical line on a black 
background, measured the time taken 
to count them, and concluded that 4 
was the limit of perceptual counting. 
Bourdon (1908) also used the method 
of reaction time and found a small 
increase in time as a function of num- 
ber of dots to be counted. He judged 
that 7 was the limit of the number that 
could be accurately ascertained at one 
glance. Saltzman and Garner (1948) 
used concentric rings as targets and 
decided that the method of reaction 
time was useful for the study of per- 
ception of number. 

Von Szeliski (1924) found that 
fields of up to 6 dots or figures could 
be perceived without eye movement. 
Jensen, Reese, and Reese (1950) used 
fields of up to 29 dots and found that 
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errors appeared first in ll-dot fields 
and became more frequent in larger 
fields. After 6 dots, the plot of reac- 
tion time against number of dots be- 
comes more scattered, and a more 
shallow, positively accelerated curve 
appears. From this, Jensen et al. con- 
cluded that Ss subitize up to 5 or 6 
dots. 

Taves (1941) studied judgments of 
numerousness (considered a stimulus 
variable like loudness) and stated that 
two mechanisms were involved—one 
used for up to 7 dots, the other for 
larger fields. He found that judged 
numerousness was decreased if the 
dots were arranged into a single figure, 
for example, a circle. Kaufman, Lord, 
Reese, and Volkmann (1949) also sug- 
gested two separate discriminatory 
mechanisms—one acting for up to 6 
dots, the other for larger collections. 

All of these studies concentrate upon 
rapid apprehension or judgment of 
number and are cast in a theoretical 
framework drawn from perception. 
Much effort has gone into determining 
the number of things that can be ap- 
prehended at a single glance, even 
though neither civilized adults nor 
civilized children count at a glance. 

In summary, the two main methods 
of studying enumeration are the tachis- 
toscopic, in which a display is briefly 
displayed and judged, and the reaction- 
time experiment, in which the material 
is displayed continuously, but S re- 
Sponds as quickly as possible. Both 
methods are perceptual in character. 
though the reaction-time method per- 
mits some more complex methods of 
enumeration to occur. The main ques- 
tion has been the maximum number 
of dots that can be perceived at a 
glance. This number has been esti- 
mated from about 4 to 7 and clearly de- 
pends upon Ss and the display used. 

It is amazing that although everyone 
knows that objects are enumerated by 


counting, most studies of enumeration 
or the judgment of number have at- 
tempted to rule out counting and en- 
sure that only the primitive method 
of guessing is employed. Classical 
psychological theory assumes that per- 
ception is a basic and simple process 
from which other, more complex cog- 
nitions are to be derived ; furthermore, 
the task of experimental psychology is 
to analyze compound experiences into 
their elements. Following these tenets, 
the process of counting is reduced to 
its elements, "perception" of numbers, 
and no effort is expended in synthesiz- 
ing the process oí counting. 

The experiments reported below 
have a different concept of enumera- 
tion. First, it is assumed that enu- 
meration is a compound task, but that 
the true and interesting structure of 
the task is not iound by analysis into 
bits, but by determining how the parts 
go together. Second, it is assumed 
that perception of small numbers may 
be a skill developed by adults, a sort 
of shortcut to counting, rather than 
an elementary mental event. Through- 
out, the authors will use the reaction- 
time method. The S is instructed to 
enumerate as quickly as possible. 


EXPERIMENTS I AND II 
Method 


Subject. The first of these experiments 
used 18 students in an ungraded class in the 
summer of 1964, 10 boys and 8 girls, ranging 
in age from 7 years, 3 months to 9 years, 
10 months. The second experiment used 20 
college students, ages 17-22, drawn from the 
elementary psychology course at Indiana 
University. $ 

Apparatus and procedure. The stimulus 
cards were 83 X 11-inch sheets of white paper 
on which objects to be counted were out- 
lined in black ink. Each S was run 
separately; each sat across a table from E, 
in an otherwise unused classroom which was 
arranged to reduce external visual distrac- 
uw card was presented flat on the table, 
first shielded with an aluminum panel on a 
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i ised at each trial to 
tonde wa petis a standard timer was 
display cicnultaneously. Response and la- 
engaged S e were recorded. Time was 
teney OF TET i d eration 
spent gaining the attention and coop! 
of each child whereas somewhat more 
£ormal instructions were given to the college 

dents. 

à cm design. Each S counted the 
objects on 40 stimulus cards which were 
presented in a different random order for 
each S. Eight of these were fillers inserted 
to make the repeated appearance of certain 
numbers less obvious. 

Variables were (a) number of objects to 
be counted, (b) arrangement of objects—line, 
circle, rectangle, scrambled, (c) all objects 
the same shape or different shapes. 

Because of the rectangular arrangements, 
numbers with two fairly equal factors were 
employed—12, 15, 16, and 18. Filler cards 
had 5, 9, 10, 11, 13, 14, or 17 objects. Ob- 
jects were spaced about 4 inch apart, with 

the straight line being horizontal, and the 
rectangles being 4x3, 5X3, 4X4, and 
6X3. Objects on a card were either all 
the same shape or divided equally among 
squares, circles, isosceles triangles, and tear- 
drop shapes. The circle's diameter was 3 
inch; all other figures were of approximately 
equal apparent size. Thus, 4 Numbers X 4 
Shapes X Same or Different yields 32 dis- 


plays in a simple factorial design, replicated 
once per S. 


Results 


Most children employed 


r pointing 
and chanting, 


though some counted 
silently and sometimes without point- 


ing, while college students counted 
silently, occasionally pointing. All 
children counted all displays, making 
106 errors out of 576 counts (18.4% 
error) with a mean time of .60 second 
per object. The college students aver- 
aged .30 second per object and made 
only 11.6% errors. 

Larger sets of objects took longer 
to count for both groups. Linear equa- 
tions fit both sets of data well, being 
t= 75(N — 3) for the children and 
t= .35(N—2) for the college stu- 
dents, where £ is a mean reaction time 
in seconds, and N is the number of 
objects to be counted. Since no meas- 


urements were taken with small num- 
bers of objects, there is no reason to 
believe the straight line may be ex- 
trapolated back without curving (see 
Jensen et al., 1950). 

Object arrangement influences both 
groups' response latencies, with the 
children counting the Rectangle most 
rapidly, then the Line, the Circle, and 
the Scrambled arrangement, in that 
order. The proportions of errors for 
the children were: Rectangle, .097; 
Line, .104; Circle, .215; and Scram- 
bled, .326, thus showing the Scrambled 
arrangement most difficult both in time 
and errors. For college students, there 
was little difference between Line, Cir- 
cle, and Scrambled, but the Rectangle 
was much faster. 


Discussion 


The results seem to show a differ- 
ence between children's and college 
students’ use of spatial arrangement. 
Children may use spatial arrangement 
to get a simple ordering of the stimuli, 
while college students seem to make 
special use of the rectangular array, 
presumably by using multiplication. 
The children may not multiply faster 
than they enumerate or they may not 
have learned to apply the concept of 
multiplication to this situation. 

When they cannot multiply, college 
students and perhaps children may di- 
vide the set of objects, possibly by 
perceptual principles of grouping, into 
smaller subsets, enumerate the subsets, 
then add the numbers, Sets of up to 
5 or 6 may be subitized by skilled adult 
Ss. Calculation Speed should depend 
Upon convenience of grouping, enu- 
meration, and addition. Rectangular 
arrays enforce a convenient subdivision 
Which permits Several simultancous 


€numerations. Indeed, both groups 
consistently count the rectangular ar- 
ray rapidly, 


but for the college students 
the array of 16 Objects is especially 
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TABLE 1 


MEAN Time to Count (IN SEC.) BY YOUNG CHILDREN 


Arrangement of objects 


No. 
ob- Line Circle Rectangle Scrambled 
jects 

Same | Error| Diff. | Error| Same | Error| Diff. Eod Same | Error| Diff. | Error| Same | Error| Diff. | Error 
12 6.58 1 6.36 L 8.05 $ 1 1 5.62 1 8.77 4 3 
15 9,28 2 8.45 0 9.45 3 3 2 8.32 0 9.98 7 6 
16 9.11 3 8.63 3 9.04 5 4 2 8.68 3 10.49 6 bi 
18 | iit 3 8.04 2 10.94 4 | 6 4 9.52 1 12.36 rd 6 

Note.—Proportion of errors is as follows: Line, .104; Circle, .215; Rectangle, .097 ; Scrambled, .326. 
fast. This array was 4 X 4, and previ- EXPERIMENT III 
ous informal observations have shown i 
In the first two experiments, only 


that older children, and presumably 
college students, are faster at squaring 
than at other multiplications. Sets of 
16 are not counted unusually rapidly 
in other arrangements, or in any ar- 
rangement by young children, so there 
1s reason to believe that the advantage 
lies in multiplication. 

It may also be noticed that rectangu- 
lar arrays are counted more rapidly if 
all the objects are the same, whereas 
with circular and linear arrangements 
counting is faster with different ob- 
jects. Rectangular arrangement en- 
forces a clear grouping on the set; 
differentiation of figures only competes 
with this grouping. Spatial grouping 
is more difficult with straight lines and 
circles, and Ss may use dissimilarities 
among figures to provide the necessary 
groupings. 


position and shape of the objects were 
varied. In the third experiment, 
color variation is also used so as to 
establish alternative perceptual group- 
ings. 

The general design of this experi- 
ment is based upon earlier studies of 
additivity of cues in cue learning 
(Bourne & Restle, 1959; Bower & 
Trabasso, 1963; Restle, 1955, 1959a, 
1962; Trabasso, 1960, 1963). In such 
experiments several problems are con- 
structed using two or more dimen- 
sions. One group may learn to classify 
drawings based on their shape, disre- 
garding color. Another group may 
classify drawings using color, disre- 
garding shape. A third group learns 
a classification in which color and 
shape are redundant, which is easier 
than the other two (additivity of cues). 


TABLE 2 


Mean Time TO Count (ix SEC.) By COLLEGE STUD) 


ENTS 


Arrangement of objects 


No. ; — RD 

objects Line Circle -— Scrambled 

Samel Errer| Dit. | Ecror| Samel x e sa ensi s 

E e| Error} Difi. | Error! Same| Error| Diff. | Error] Samel Error] Ditt. | Error] Same] Error) Dif. | Error 
12 |392| 0 o |421| 2 3.55| 2 
15 1528 | 4 3 |ss1| 4 * re. aor | 2 
16 5.59 | 4 2 |599| 1 1 2m ? $45| 4 
is |607| 2 0 |653| 3 DEAE 613| 5 


442 Mary BECKWITH AND FRANK hESTLE 


TABLE 3 


Mean Time TO Count (IN sEC.), EXPERIMENT III 


Conditions 
CM gem mre: | Shape | Ambiguous | Random 
objects | | ae p e 
“gine | mrs | dime | Ernes | Time | Errors | Errors | Errors | Time | Errors 
|- i 21 | | | | 380 | 3 
| | | 439 | 2 380 | 3 
| dig | 4 347 1 à | ae | 2 3 
ia | "E IE: 452 | 5 | | 2 | 476 | 5 5.16 4 
16 | 455 | 0 4.32 PO 556 | 4 aar] 2 5.21 | 0 
17 | 4.82 | 9 | 544] 1 | 693 | 5 | 38 | 2 | om} s 
18 | $39 | 7 | 596 | 4 | 6.40 | 6 |593| 1 | T12 | 6 


Several mathematical models of cue 
learning have given quantitative pre- 
dictions of learning speed based on the 
number of redundant relevant cues, 
the number of irrelevant cues, and the 
weights and strengths of such cues. 


Method 


Subjects. Twenty-four college students, 
aged between 17 and 20, from an introductory 
psychology class served as Ss. 

Apparatus and procedure. 
identical with Experiment II. 

Materials and design. Thirty displays, each 
consisting of a collection of shapes cut from 
construction paper and pasted to 83 X 11-inch 
sheets of white paper were used. The shapes 
were four colors: brick red, Air Force blue, 
light blue-green, and light violet, They were 
four shapes: circle, Square, triangle, and 
parallelogram. Test items had 12, 15, 16, 17, 
and 18 figures, and 5 filler presentations. In 
Condition 1, Color and Shape, the four colors 
and shapes were segregated, so that, for 
example, one section of the figure was all 
blue squares, another part red circles, etc. 
In Condition 2, Color, the four colors were 
segregated, but the shapes were scattered 
randomly over the field. In Condition 3; 
Shape, the four shapes were segregated, but 
the colors were scattered randomly over the 
field. In Condition 4, Ambiguous, both the 
four colors and the four shapes were 
segregated, but differently, so that the four 
circles might be half blue and half red, and of 
the three triangles, two might be red, and 
one violet, etc. In Condition 5, Random, 
both color and shape were scattered inde- 
pendently and randomly over the field. 


These were 


Hypothesis. If rapid counting in the col- 
lege student depends upon grouping the field 
of objects, then Condition 1, having a strong 
perceptual organization based on color and 
shape, should be counted most rapidly. Con- 
ditions 2 and 3, having color and shape ar- 
rangements, respectively, should provide esti- 
mates of the relative grouping value of the 
two dimensions. Condition 4 gives S two 
conflicting modes of organization. If one 
mode is much stronger than the other, the 
condition should be about as fast as the con- 
dition employing only the strong variable. 
Nearly equal variables might lead to insta- 
bility of grouping, perhaps forcing S to start 
Over in his counting. Condition 5, having 
no principle of grouping available, should 
lead to counting rates close to those observed 


in Experiment Il, scrambled arrangements 
for college students, 


Results 


Time to count was a linear function 
of total number of Objects. The results 
here are very close to those obtained 
with college students for the Scram- 
bled condition in Experiment IL, This 
indicates that the general findings are 
Probably quite stable With successive 
samples from this pool of Ss. 

As Predicted, the Color and Shape 
condition is counted fastest. Condition 
2, Color, is almost as fast, and Condi- 
tion 3, Shape, is slower, so it appears 
that color is a much more effective 
dimension than shape in this experi- 
ment. This agrees with many experi- 
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mental studies comparing the two 
variables, varied approximately as in 
this experiment. 

In these data, shape segregation has 
little advantage over Random, Condi- 
tion 5. Therefore, it is predictable 
that the Ambiguous condition, having 
one strong (color) and one very weak 
(shape) organizing variable in compe- 
tition, would be almost as fast as color 
alone. Notice that it is faster than 
Shape (Condition 3) or Random 
(Condition 5), though a small effect 
of shape is found in the fact that Con- 
dition 4 is slower than Conditions 
1 or 2. 

An analysis of variance showed that 
both the number of objects and the 
five conditions produce significant (p 
< 01) effects on speed of counting. 
These results establish that rapid 
counting depends upon grouping the 
material into subgroups. It is proba- 
ble that Ss segregated the whole dis- 
play into color groups, subitized the 
number in each group, and then added 
these numbers to obtain the result. 
When no such grouping was possible, 
S was forced to rely on a relatively 
difficult and ambiguous grouping by 
position, or to count one by one. 


Discussion 


The experiments reported above 
show that in counting, the set of ob- 
Jects is grouped perceptually, in accord 
with Wertheimer's rules of propin- 
quity, good continuation, and similar- 
ity. This grouping then serves as a 
support or basis for counting, in that 
S performs his counting operation 
within groups and then in some fashion 
connects the various partial counts. 
One hypothesis is that college students 
subitize within each group, that is, 
determine the number by a somewhat 
mysterious but very rapid and accurate 
"perceptual" method. The S might 


4 


collect such numbers and add them at 
the end or might subitize each group 
and add the result to a running total. 

The fact that fairly young children 
show great sensitivity to the organiza- 
tion of the visual field, as shown in 
Experiment I, suggests that grouping 
may play a more general role in count- 
ing. That is, even when a child is 
enumerating one by one, he may work 
rapidly and routinely within one group, 
pause and consolidate or "store" his 
result in some way, and then attack 
the next group. The pausing, and the 
ability to divide the task into suitable 
parts, is possibly a generally important 
part of a long serial task. 
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COGNITIVE AND DRIVE FACTORS IN THE 


EXTINCTION OF THE CONDITIONED 
EYE BLINK IN HUMAN SUBJECTS* 


KENNETH W. SPENCE 
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Extinction of the eyelid conditioned response (CR) in humans has 
been shown to be a function of factors relating to Ss' capabilities to 
discriminate the change in stimulus conditions at the inception of 
extinction. Hypothesizing that the rapid extinction obtained with 
humans in the standard conditioning situation results from an in- 
hibitory set not to respond based on this discrimination, subsequent 
experiments in a masking situation designed to control (eliminate) this 
cognitive factor revealed that extinction of the eyelid response in 
humans proceeds at a relatively slow rate, as in animals. The findings 
with the masking situation also suggest that the partial reinforcement 
effect is absent in humans as in the case of classical aversive extinc- 


tion with animals. 


The empirical construct type of the- 
ory espoused by Hull (cf. 1943, 1952) 
and Spence (1956, 1958, 1960) takes 
its start from the findings of laboratory 
studies of classical and instrumental 
conditioning. That is, in this type of 
theorizing the specification of the quan- 
ütative properties of the theoretical 
concepts is based on empirical laws 
found in these simple kinds of experi- 
ments. Progress in the development 
of this theory has been most rapid in 
the areas of acquisition and motivation. 
On the basis of laws relating various 
response measures to such experimen- 
tal variables as the number of rein- 
forcements, magnitude of the reinforc- 
ing event, and deprivation operations, 
theoretical constructs such as associa- 
tive or habit strength (H), incentive 
motivation (K), and drive level (D) 
have been introduced. 


THEORETICAL INTERPRETATIONS OF 
EFFECTS OF NONREINFORCEMENT: 
CONCEPTS OF INHIBITION 


Much less satisfactory is the portion 
of the theory that deals with the role 
of the nonreinforcement variable in de- 


termining response strength. In his 
Principles of Behavior Hull (1943) 
introduced the constructs of work in- 
hibition (Jz) and conditioned inhibi- 
tion (SR) to handle this area of 
behavior. While these constructs might 
have some validity in relation to the 
phenomenon of response effort, they 
have proved entirely inadequate as far 
as the behavioral phenomena resulting 
from nonreinforcement are concerned. 

Actually, there was relatively little 
data on extinction phenomena availa- 
ble to Hull at the time he formulated 
his theory. But what little there was 
had led the present author to a very 
different conception from the one Hull 
(1943) proposed. As early as 1936, 
in a theoretical article on discrimina- 
tion learning, the present author in- 
terpreted nonreinforcement of an ap- 
petitionally motivated response as in- 
volving frustration (Spence, 1936). 
While time limitations did not permit 
a discussion of extinction phenomena 
based on collegene talks 
i of universities. Prepara- 
ER A was supported by Con- 


5 { Office of 
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in the Silliman Lectures (Spence, 
1956), in connection with the treat- 
ment of delay of reinforcement, it was 
pointed out that the two experimental 
operations, delay of reinforcement and 
nonreinforcement, were highly similar 
and presumably produced the same 
kind of behavioral effects. dt was 
further suggested that underlying these 
effects were frustration-aroused com- 
peting responses. Since these Lec- 
tures, a number of my former students 
and I have further elaborated this 
frustration. theory of inhibition, par- 
ticularly in relation to a fairly sizable 
body of quantitative phenomena that 
have been shown to occur in instru- 
mental reward conditioning and ex- 
tinction under different schedules of re- 
inforcement (cf. Amsel, 1958 ; Spence, 
1960). 

Contrasting sharply with these re- 
cent developments in the instrumental 
reward type of conditioning, there has 
been little or no advance in our knowl- 
edge concerning experimental extinc- 
tion in classical reward conditioning 
of the Pavlovian type. Unfortunately, 
the Russians did not go in for the kind 
of parametric studies that provide the 
type of quantitative laws that are 
needed for such theorizing as ours. 
As a consequence, it is not known to 
what extent the relevant variables and 
laws of extinction in classical reward 
conditioning are similar to or different 
from those discovered in the instru- 
mental variety. Needless to say, this 
deficiency precludes the possibility of 
ascertaining whether our theory, as 
inferred from instrumental data, is suf- 
ficiently general to encompass classical 
phenomena. It is the author's own 
expectation that the laws of extinction 
of the classically rewarded response 
will be sufficiently different from those 
of instrumental reward learning as to 
require a somewhat different theoreti- 
cal formulation. For example, the 
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author does not believe that frustra- 
tion-aroused, incompatible responses 
play the kind of role in the extinction 
of the classical response that has been 
conceived as occurring in the extinc- 
tion of the instrumental response. But 
until more precise empirical data have 
been obtained concerning the effects 
of such variables as the magnitude and 
schedule of the original reinforcing 
stimulus on the subsequent extinction 
rate of the classical salivary response, 
such speculation is not very fruitful. 
There simply are too many alterna- 
tive theoretical possibilities to choose 
among, and, since the experiments 
have to be conducted if the laws are 
to be obtained, we had best get to the 
job of doing them. 

What has just been said about clas- 
sical reward conditioning also holds 
for aversive types of conditioning. 
Certainly failure of occurrence of 
the noxious unconditioned stimulus 
(UCS) such as occurs in the extinc- 
tion of a classical defense response can 
hardly be conceived as frustrating to 
the organism. Obviously, some other 
theoretical interpretation must also be 
found for this type of situation, other, 
that is, than that developed for in- 
Sstrumental reward conditioning. But 
again, it should be emphasized that 
the pressing need is for empirical stud- 
ies that will provide the laws describing 
the course of experimental extinction 
of such responses and the variables 
that influence it. The remainder of 
this paper will be concerned with the 
findings of such studies, most of which 
Were conducted at the University of 


Towa under contract with the Office 
of Naval Research, 


PROBLEM oF HIGHER PSYCHOLOGICAL 
PROCESSES IN Human Cox- 
DITIONING STUDIES 
The studies of cla 


ene ssical defense con- 
ditioning conducted 


in our laboratory 
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have involved the use of human sub- 
jects (college students), and the re- 
sponse conditioned has been the eye 
blink or corneal reflex as elicited by 
a puff of air directed at the eye. À 
major problem that arises in such re- 
search is that of controlling, with a 
view toward eliminating, the influence 
of higher psychological (symbolic) 
processes on these simple learned re- 
sponses. To the extent that such 
higher activities do occur in these ex- 
periments the laws obtained are a 
function of unknown and unwanted 
variables. Certainly, such uncontrolled 
experiments would not provide the 
laws of simple learning that are being 
sought and thus would be of little value 
for the present theoretical endeavors. 


Acquisition Phase 


The author’s evaluation of these 
studies is that, at least as far as the 
acquisition phase of conditioning ex- 
periments is concerned, the data (i.e., 
laws) they have provided are fairly 
adequate—in the sense that they are 
more or less free from the influence of 
these complicating factors. However, 
the same cannot be said for the data 
on the extinction phase. Indeed, it is 
strongly suspected that complex proc- 
esses, call them cognitive factors, sets, 
mediating processes, or what you will, 
have greatly affected the extinction 
data heretofore reported for human 
Subjects. If such is the case, then 
Obviously these data do not provide 
a satisfactory basis on which to con- 
struct that part of the present theory 
concerned with the effects of non- 
reinforcement. 

There is not sufficient time to docu- 
ment the basis of the author's optimism 
concerning acquisition data. It is 
enough to note that, on the whole, 
quite consistent data on acquisition are 
obtained from one study to another 
and across different laboratories. But 
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more important, the laws relating the 
conditioning performance of humans 
to the several variables that have been 
investigated appear to have the same 
form as those obtained with animals. 
The one major difference—one, inci- 
dentally, that does not seem to be too 
well known—is that classical condition- 
ing occurs with much greater rapidity 
in humans than animals. While this 
finding suggests different rate-of-con- 
ditioning parameters for different spe- 
cies, it is also possible that it merely 
reflects differences in the motivational 
level of different species under the 
same intensity of the UCS. For ex- 
ample, the extremely slow conditioning 
of the eyelid response in the rabbit 
may be due to a lesser sensitivity of 
the rabbit’s cornea. It is known 
that the rate of conditioning in human 
subjects does vary with the intensity 
of the UCS. More research is needed 
with animal subjects under different 
UCS intensities before a correct inter- 
pretation can be made. The author 
would merely like to add that as a 
consequence of the early studies of 
Hilgard and his students on the effects 
of different kinds of verbal instruc- 
tions, it appears that it is known 
how to produce “sets” in human sub- 
jects which minimize the influence of 
these cognitive factors, at least during 
the acquisition phase of conditioning 
(cf. Hilgard, Campbell, & Sears, 1938; 
Grant, 1939). 


Extinction Phase 


to the extinction data 


Turning now a 
reported heretofore, at least two major 
the findings ob- 


discrepancies between te, ; 
mal subjects 


tained with human and ani 


are found, both of which led the author 
tinction perform- 


to suspect that the ex f 
ance of the humans was greatly influ- 
enced by cognitive factors not present 


in animals. The most striking of these 
differences and the one with which 


448 KENNETH 
this paper will be primarily concerned 
is that in animals the extinction rate is 
about the same as their rate of con- 
ditioning, whereas in humans extinc- 
tion occurs much more rapidly than 
acquisition. Indeed, as will be seen, 
under the condition of discontinuing 
he UCS, extinction occurs in most 
human subjects in one or two trials. 
A number of studies have shown 
that extinction of the conditioned eye- 
id response is relatively slow in ani- 
mals. Thus, Hilgard and Marquis 
(1936) found that of four monkeys 
who had their eyelids conditioned to a 
flash of light, three revealed no per- 
ormance decrement whatever jn 180 
nonreinforced trials, while the fourth 
decreased to a level about 50% of 
the final acquisition strength. Subse- 
quently, one of the monkeys was sub- 
jected to 720 extinction trials, at the 
end of which it was still responding 
at a 60% conditioned-response (CR) 
level (Hilgard & Marquis, 1936). A 
similar finding was obtained by Hil- 
gard and Marquis (1935) in the case 
of eyelid conditioning in dogs, with 
extinction Proceeding very slowly. 
More recently experiments with 
rabbits (Gormezano, Schneiderman, 
Deaux, & Fuentes, 1962; Schneider- 
man, Fuentes, & Gormezano, 1962: 
Thomas & Wagner, 1964) have 


also 
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trials reported by Humphreys (1939) and 
Grant and Schipper (1952). 
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revealed that extinction in these ani- 
mals is a relatively prolonged process. 
For example, in a study in which 
extension of the nictitating membrane 
of the eye of the rabbit was conditioned 
to a tone, it was found that extinction 
took longer (560 trials) than was re- 
quired to reach the acquisition asymp- 
tote (420 trials). Similarly, Bitterman 
and his colleagues have found that ex- 
tinction of a classical aversive CR in 
the pigeon and the goldfish was slower 
than the original acquisition of the 
CR (Gonzalez, Longo, & Bitterman, 
1961; Longo, Milstein, & Bitterman, 
1962). 

In sharp contrast to these extinction 
findings obtained with animals are the 
data on extinction of the eyelid CR 
in humans. Figure 1 presents some 
tvpical data from two early inves- 
tigations of the conditioned eye blink 
in humans (Grant & Schipper, 1952; 
Humphreys, 1939). In both of these 
studies the extinction procedure, as in 
the case of all of the animal studies, 
involved discontinuance of the UCS. 
The most significant thing to note 
about these curves is the rapidity with 
which extinction takes place. Less 
than 12 trials were required to reach 
a random level of responding in the 
Humphreys’ (1939) study, while the 
curve of Grant and Schipper (1952) 
had reached this level by Trials 11-15. 
Not only do these graphs reveal ex- 
tinction of the conditioned lid closure 
to be much faster in humans than ani- 
mals, but also the ratio of extinction 
trials to acquisition trials is much 
smaller than in animals. Thus in 
these two studies the number of trials 
to extinction, 12-15, was only about 
20-40% of the number of trials re- 
quired to attain the 


tote, a finding quite typical of the data 
available in the literature, In animals, 
on the other hand, as many or even 


acquisition asymp- 
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more trials are required to extinguish 
a response as were involved in the 
original conditioning. 

But, as is now known from recent 
studies conducted in our laboratory, 
discontinuance of the air puff leads to 
even more rapid extinction in humans 
than was revealed by the data of these 
earlier experiments. The manner in 
which the curves in these studies were 
drawn actually masked the abruptness 
of the response decrement. The rea- 
sons for this are that (a) too large 
blocks of trials were employed in draw- 
ing the curves; and (b) the data for 
the first extinction trial was included 
in the first block of trials. Obviously, 
a response on the first extinction trial, 
occurring as it does before the first 
nonreinforcement, does not reflect the 
effects of this operation. The more 
appropriate procedure is to plot the 
data in terms of the number of pre- 
vious nonreinforced trials. Moreover, 
graphing the curves in terms of single 
trials or blocks of two trials provides 
a more precise picture of the early 
course of the response decrement. 

Figure 2 presents a recently obtained 
curve of extinction for human subjects 
drawn in this manner (Spence, Rut- 
ledge, & Talbott, 1963). Under this 
procedure the data for the first point 
(0) on the graph, which is the first 
trial on which no reinforcement is 
given, reflects performance after the 
last reinforced trial, while the data for 
Trials 1-2, 3-4, etc., represent the 
average frequency of response to the 
conditioned stimulus (CS) after the 
indicated number of nonreinforced 
trials. As may readily be seen, after 
only 2-4 extinction trials, the fre- 
quency of the CR has dropped to a 
random level. 

This and other recent studies from 
our laboratory have clearly revealed 
that the rate with which the condi- 
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Fic. 2. Percentage of CRs during ex- 
tinction as a function of number of previous 
nonreinforced trials in which UCS was 
omitted (Spence et al., 1963). 


tioned eye blink in humans decreases 
in strength under the extinction pro- 
cedure of omitting the UCS is ex- 
tremely rapid, not only as compared 
with animals, but also as compared 
with the rate of acquisition of the re- 
sponse in humans. Whereas from 50- 
80 trials are typically needed to con- 
dition the eyelid, extinction requires 
very few. Indeed, in the case of the 
above experiment more than half the 
subjects never gave a single CR fol- 
lowing the first nonreinforced trial. 


Drive LEVEL AND EXTINCTION 
or Eyerip CR 


How is this striking difference in 
the extinction behavior of animals and 
man to be interpreted? One possibil- 
ity, of course, is to assume that the 


quantitative properties of the effects 


of these two experimental operations, 


reinforcement and nonreinforcement, 
are very different in humans and suns 
ply assign appropriate parameter val- 
ues to the rates of their development. 
But only the most naive of theorists 


would proceed on such an assumption 
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without considering the alternative 
possibility that the extreme rapidity 
of this extinction in humans reflects 
the operation of factors other than the 
development of a simple learning (non- 
reinforcement) effect. . 
Another possibility is that some kind 
of higher psychological process takes 
over control of the behavior during 
extinction in the case of the human 
subject. A number of early experi- 
ments (cf. Grant, 1939; Miller, 1939) 
had shown that instruction-produced 
sets can markedly facilitate or inhibit 
the level of response in such condition- 
ing experiments. It seems entirely 
possible that self-instructions, of which 
the subject might or might not be 
aware, could play a decisive role in 
determining this extinction perform- 
ance. The first idea that occurred was 
that the human subject readily recog- 
nizes the discontinuance of the noxious 
UCS, ceases to expect it, and thus re- 
duces his level of emotional response 
(fear) and hence his drive level (D. 
The rapid extinction in humans could 
thus be accounted for in terms of a 
sudden drop in the level of D. 
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McAllister Experiment 


An experiment that appeared to lend 
some suport to such a notion had been 
run some years earlier in our labora- 
tory by McAllister (1953). After as- 
certaining that little or no eyelid con- 
ditioning occurs in human subjects 
when a CS-UCS interval of 2.5 sec- 
onds was employed, McAllister then 
demonstrated that CRs established at 
an optimal CS-UCS interval (.45 sec- 
ond) extinguished when the interval 
was extended to 2.5 seconds. Extinc- 
tion of subjects under this latter 
method provides a test of the present 
drive hypothesis, for under this condi- 
tion the UCS is continued, and thus 
the drive level of the subject is main- 
tained throughout the extinction pe- 
riod. Accordingly, it would be ex- 
pected that the performance curve 
would drop much more slowly under 
this condition than when the UCS is 
discontinued altogether, 

As shown in Figure 3, McAllister’s 
results confirmed this expected differ- 
ence. The lower curve represents the 
performance of subjects extinguished 
without the UCS. The solid upper 
curve portrays the performance of sub- 
jects that had the delayed UCS. Both 
curves give the percentage of CRs that 
occurred in the interval 100-450 milli- 
seconds following the onset of the CS 
for successive blocks of 10 trials. It 
is clearly evident that the decrement 
in performance developed much more 
slowly in those subjects that had the 
UCS present but delayed. In line 
with the above reasoning, McAllister 
interpreted this greater resistance to 
extinction as being due to the main- 
tenance of the level of D when the 
delayed UCS was employed, whereas 
the level of D decreased when the UCS 
was discontinued. While this interpre- 


tation in terms of different levels of D 


nicely accounts for the performance 
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difference between these two groups, 
it does not, it should be noted, explain 
why the extinction in human subjects 
Is so precipitous, so rapid relative to 
that in animals which also had the 
UCS discontinued. 


Reynolds Experiment 


, The inadequacy of this interpretation 
in terms of a difference in level of D 
only was further revealed by the find- 
Ings of a subsequent study conducted 
for a doctoral dissertation by Reynolds 
(1958). While one part of Reynolds' 
study supported the findings of McAI- 
lister, another portion did not. Mc- 
Allister's groups had, for another pur- 
pose, been conditioned with a partial 
reinforcement schedule (80%). Con- 
firming McAllister, Reynolds found 
that when the original conditioning 
involved a partial reinforcement sched- 
ule (60%), the group with the delayed 
S again extinguished much more 
slowly than the group in which UCS 
was discontinued. This can be seen 
in Figure 4 in which the extinction 
curves for the partial groups (open 
circles) start at the 50% level. 

In the case of subjects who acquired 
the CR under 100% reinforcement, 
however, Reynolds failed to find any 
difference whatever in extinction per- 
formance. As may be seen, the de- 
layed CS-UCS interval group showed 
the same precipitous decrease in re- 
sponse strength as did the group that 
had no UCS in extinction. Note 
should be taken of the fact that the 
curves in this graph are of the earlier 
type that included the first nonrein- 
forced trial in the first block of 10 
trials. Omitting this trial, the level 
of response for the two groups on 
Trials 2-10 fell slightly below 20%. 
This finding with 100% reinforcement 
has since been replicated at two differ- 
ent levels of original conditioning in 
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Fic. 4. Showing effects on extinction in 
standard situation of partial (P) and con- 
tinuous (C) reinforced training trials and of 
delayed UCS (1) and omitted UCS (2) 
during extinction. (From Reynolds, 1958.) 


an experiment in which the ready sig- 
nal was not employed (Spence et al., 
1963). In both instances there was 
no difference in extinction rate be- 
tween groups that had the UCS 
omitted and groups that had the UCS 
present but delayed during extinction. 


DISCRIMINATION HypotHeEsis: CoN- 
CEPT OF ĪNHIBITORY SET 


These findings clearly indicated that 
this simple drive interpretation was 
not adequate, and that it was necessary 
to appeal to some factor other than 
reduced drive level to account for both 
the facts of no difference in extinction 
rate and the extreme rapidity with 
which the CR extinguished. The clue 
followed in the attempt to ascertain 
what this factor might be was obtained 
by questioning subjects following ex- 
tinction. In the Spence et al. (1963) 
study and in Goldstein’s (1962) ex- 
ploratory experiment it was found that 
practically all subjects reported that 
they recognized the change in condi- 
tions that occurred with the shift from 
acquisition to extinction. On the basis 
of this information it was hypothesized 
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that human subjects, on noticing the 
change in stimulus conditions, become 
aware that their eyelid has been con- 
ditioned and thereupon more or less 
immediately adopt a set not to blink 
to the CS. Such an inhibitory set, it 
should be noted, would account for the 
sudden disappearance of the CR fol- 
lowing consistent reinforcement. 

Under partial reinforcement condi- 
tioning, on the other hand, the subject 
has experienced nonreinforcements on 
some portion of the acquisition trials 
(e.g. 50%). Hence the shift to ex- 
tinction (i.e., 50%-0%) should not be 
as noticeable as the shift from 100% 
to no reinforcement, Presumably rec- 
ognition of the shift to zero reinforce- 
ment in this circumstance would be 
delayed and possibly in some subjects 
would not occur at all. In this event 
the postulated inhibitory set would not 
be present, at least not during the early 
Stages of extinction, Furthermore, 
with its absence the CRs would be able 
to occur and thus could reflect any 
differences in the drive level of the 
subjects that might be present. Ac- 
cordingly, it would be expected that 
subjects in whom the UCS is discon- 
tinued would show a more rapid decre- 
ment than do subjects in whom the 
UCS is continued, but delayed. This 
predicted result is, it will be noted, 
in accord with the findings obtained 
with the partially reinforced groups of 
McAllister and Reynolds. 

Finally, this hypothesis predicts the 
well-established finding that extinction 
of classical CR in humans is not nearly 
as rapid following partial reinforce- 
ment as after continuous. This inter- 
pretation of the difference in extinction 
performance following these different 
reinforcement schedules is, as will be 
recognized, a type of discrimination 
hypothesis. The present version, how- 
ever, differs from previous ones in its 
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introduction of a more specific hypo- 
thetical mechanism, the inhibitory set. 


INTRODUCTION Or THE MASKING TASK 


Our most recent investigations have 
been primarily directed at testing this 
theoretical interpretation of extinction 
by attempting to arrange experimental 
conditions that would reduce to Zero, 
if possible, the discriminability of the 
change in conditions with shift from 
acquisition to extinction. To the ex- 
tent that this objective could be 
achieved, the role of the hypothetical 
inhibitory set during extinction would 
be minimized. Presumably, if no dis- 
crimination occurred the set would not 
be operative, and extinction of the CR 
would be a function of simpler non- 
reinforcement effects (In), as in ani- 
mals. 

The effects of a number of changes 
have been investigated (Spence, Hom- 
zie, & Rutledge, 1964), the major one 
of which involved the use of a mask- 
ing learning situation in terms of which 
the purpose of the experiment was 
Specified. But first, the reasons for 
introducing this change: Questioning 
of the subjects following the comple- 
tion of extinction in a recent study 
(Spence et al, 1963) revealed that 
52% were able to recognize that the 
experiment was concerned with the 
conditioning of the eye-blink response. 
This percentage is to be compared with 
the findings of earlier studies that had 
involved Conditioning, but not extinc- 
tion. In such studies the percentage 
of subjects who were able to identify 
the experiments as involving condi- 
tioning had ranged around 5-10%. It 
75 apparent from this that the added 
experience of extinction greatly in- 


gnition by the subject 
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postulated might play a critical role in 
determining performance during ex- 
tinction. 

A technique was sought, then, that 
would preclude the subject’s recogniz- 
ing that he was participating in a 
conditioning experiment even when 
extinction was attempted. It was also 
hoped that the procedure would de- 
crease still further the likelihood that 
the changes in the stimulus events rela- 
tive to extinction would be observed. 
The technique hit upon was that of 
embedding the conditioning experi- 
ment within the setting of another 
a situation (Spence et al., 

The learning task used for this pur- 
pose was the light-guessing, or proba- 
bility-learning, task designed by Estes 
and Straughan (1954). This situation 
involved a centrally placed signal light 
Which, when it came on, was the signal 
for the subject to anticipate within its 
duration (2 seconds) which of two 
small bulbs, one to the left, one to the 
right of the signal light, would subse- 
quently light. The subjects were given 
à set of instructions to the effect that 
the experiment was concerned with the 
effects of distraction on performance 
In a difficult problem-solving situation. 

€y were instructed that their task 
Was to predict which of the two small 
lamps would light and to signify their 
Prediction by pressing the push button 
located on the left or right arm of their 
Chair, They were told further that dis- 
tracting stimuli in the form of a tone 
and air puff to their eye would be 
Siven between their response of press- 
ing a button and the lighting of one 
of the lamps. The onset of the tone 
coincided with the turning off of the 
signal light. The subjects were urged 
to predict to the best of their ability 
and to attempt to improve their pre- 
diction with practice. The small lamps 
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Fic. 5. Comparison of extinction rate in 
standard (Group I) and masking situations 
(Group II). (From Spence et al., 1964.) 


lit according to a prearranged schedule. 
For half the subjects, the left lamp 
lit 7 times in a block of 10 trials, the 
right lamp 3 times. In the other half, 
these frequencies were reversed. 


EXPERIMENTS WITH THE 
MASKING TASK 


The main finding of our first experi- 
ment employing this masking situation 
is shown in Figure 5. This graph 
compares the extinction of subjects 
(Group I) conditioned and extin- 
guished with a delayed UCS in the 
standard conditioning situation with 
that of a group (II) in which the con- 
ditioning and extinction were em- 
bedded in the masking situation. As 
may be seen, the slopes of the two 
extinction curves differ markedly, and 
it would appear that the CR extin- 
guishes much more slowly in the case 
of subjects run in the masking situa- 
ton. Whatever its basis, the masking 
situation produced a most dramatic 
change in the behavior of human sub- 
jects. That it is related in some man- 
ner to the recognition on the part of 
the subjects of the change in stimulus 
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conditions with shift to extinction is 
suggested by the results of a question- 
naire given at the end of the experi- 
mental period. Thus 19 (76%) of 
the 25 subjects in Group I reported 
recognition of the shift in conditions, 
while only 1 (476) of the subjects run 
in the masking situation was able to 
report the change. 

A second study (Goldstein, 1962) 
was especially concerned with whether 
subjects, following 10096 reinforce- 
ment in conditioning, would exhibit 
different extinction performances un- 
der the two procedures of omitting and 
delaying the UCS. It will be recalled 
that no difference was obtained be- 
tween groups extinguished by these 
two methods under the standard con- 
ditions in the Reynolds (1958) study 
and Spence et al.’s (1963) replication 
of it. Moreover, it will be remembered 
that extinction was extremely rapid in 
both experiments. If, as assumed, the 
inhibitory set is not operative in the 
masking situation, a difference should 
now be obtained with these two ex- 
tinction procedures involving as they 
do differences in the drive levels of 
the subjects? 

The results were in complete agree- 
ment with our assumption. As Figure 
6 reveals, the group (C-1) that had 


? Attention should be called to the fact that 
a difference was obtained in the standard 
conditioning situation between these two 
extinction procedures in a study (Spence 
et al, 1964) in which the duration of the 
CS was not changed between the acquisition 
and extinction periods as had been the case 
in the earlier studies. „The difference was 
ascribed to the lesser discriminability of the 
shift in conditions to extinction. Complicat- 
ing these results, however, are the findings 
obtained by Price, Abbott, and Vandament 
(1965) of a difference under conditions 
similar to those of Reynolds (1958). It is 
possible that the relatively slow extinction of 
the delayed-UCS groups in the Price et al, 
study resulted from unidentified voluntary- 
form responders. 
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the UCS present in extinction per- 
formed at a much higher level than the 
group (C-2) that had the UCS discon- 
tinued. The difference was highly 
significant (p < .01). The remaining 
two curves are for groups extin- 
guished following partial reinforcement 
(60%). As in the McAllister (1953) 
and Reynolds (1958) studies, the 
group (P-1) that had the delayed 
UCS responded at a considerably 
higher level during extinction than 
Group P-2 in which the UCS was 
omitted. 

The use of a more extended extinc- 
tion period (100 trials) in the Gold- 
stein (1962) study revealed, however, 
à new complication that had not been 
observed in previous data obtained 
with the delayed-UCS procedure. In 
the standard conditioning situation this 
method of extinction had always led 
to complete extinction of the response, 
that is, to the level of spontaneous 
blinking. As Figure 6 shows, how- 
ever, the curves of groups extinguished 
with the delayed-UCS technique ap- 
pear to be asymptotic at a much higher 
level (35-40%). A subsequent study 
(Spence & Rutledge, 1964) has shown 
that extinction curves obtained with 
this technique in the masking situation 
do not drop below this level (approxi- 
mately 40%), even when 150 extinc- 
tion trials were given. Examination 
of individual records indicated that this 
failure of the curves to drop to a 
random level of blinking was a conse- 
quence of the fact that approximately 
one-third of the subjects showed no 
response decrement, and some even 
showed a slight increase, 


This finding suggests that the de- 
layed-UCS technique is reinforcing for 
Some subjects and hence is not a 
satisfactory procedure for maintaining 
drive level during extinction. The fact 
that it had not appeared in previous 
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Fic. 6. Showing effect on extinction in 
masking situation of partial (P) and con- 
pem (C) reinforced training trials and 
o delayed UCS (1) and omitted UCS (2) 
uring extinction. (From Goldstein, 1962.) 


studies using the standard situation can 
ia be interpreted as resulting 
Tom the operation of the inhibitory 
s» of the subject not to respond in 

IS Situation. With no inhibitory set 
Present, as assumed, in the masking 
Situation, the reinforcing effects of the 
delayed UCS would have an oppor- 
tunity to be revealed. 

But regardless of the explanation of 
this phenomena, it necessitated the 
finding of some other method than the 
delayed-UCS technique of maintaining 
the drive level of the subject during 
extinction. The procedure subse- 
quently developed involved presenting 
the CS alone during extinction on a 
tandom half of the probability-learning 
(guessing-game) trials and the UCS 
alone on the other half. Thus the CS 
Was never paired with the UCS, but 
the drive-maintaining puff was present 
during extinction. In order to main- 
tain the average intertrial interval be- 
tween successive presentations of the 
UCS in the original conditioning the 
same as during extinction, the paired 
CS and UCS were also presented on 


455 


only a random half of the guessing- 
game trials during acquisition. On 
the remaining half of these trials 
neither the CS nor UCS occurred. 
This procedure thus gave a subject ex- 
perience during acquisition with guess- 
ing-game trials in which no UCS was 
present and hence made the shift to 
extinction less discriminable. 

The findings of an experiment 
(Spence, 1966) in which this new 
extinction technique was employed are 
shown in Figure 7. In addition to 
the subjects (Group I) extinguished 
in the manner just described two other 
groups were run. In Group II the CS 
was presented alone on half of the 
guessing trials with neither the CS 
nor the UCS being present on the 
other half. Since the UCS was not 
present in extinction, the drive level 
of Group II would, theoretically, be 
lower, and hence its performance 
should be lower than that of Group I. 
As the graph shows, this result was 
obtained (p < .01). 

In the case of the third group the 
delayed-UCS technique was employed, 
thus providing an opportunity to com- 
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Fic. 7. Extinction curves obtained in 
masking situation with three different pro- 
cedures: Group J—nonpaired UCS present, 
Group II—no UCS present, Group III—de- 
layed paired UCS present during extinction. 
(From Spence, 1966.) 


456 


pare this previously used method of 
maintaining drive during extinction 
with the new procedure. In this group 
S, and S, were presented together at 
the long interval (2,500 milliseconds ) 
on a random half of the guessing trials. 
In the other half of these trials neither 
stimulus was presented. Referring 
again to Figure 7, it is evident that 
the curve of Group III leveled off well 
above the random blink level of 18% 
response. On the other hand, the curve 
for the group extinguished with the 
new technique (Group I) dropped sig- 
nificantly below that of Group III (5 
< .025), reaching a level of about 
27%. A stil more recent study 
(Homzie, 1964) indicates that about 
80 extinction trials are required to 
bring the curve obtained under this 
condition down to the spontaneous 
blinking rate. Since only 50 trials 
were needed in acquisition to attain 
asymptotic performance, it is apparent 
that, when cognitive factors are not 
operative and drive level is main- 
tained, extinction of the eyelid CR in 
humans as in animals is somewhat 
slower than acquisition, 


EXTINCTION AS A FUNCTION oF 
REINFORCEMENT SCHEDULES 


The statement was made earlier that 
there were two major discrepancies 
between the extinction findings ob- 
tained with human and animal subjects 
in classical aversive conditioning. The 
second difference is that, whereas ex- 
tinction in humans has heretofore con- 
sistently been found to be slower fol- 
lowing a partial reinforcement schedule 
than after continuous reinforcement, 
the findings with animals reveal no 
such difference. For example, Thomas 
and Wagner (1964) have reported 
finding that extinction of the eye-blink 
response in the rabbit proceeded at the 
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same rate following 50% and 100% 
reinforcement schedules in the original 
conditioning. Likewise, studies by 
Bitterman and his associates (Gonzales 
et al., 1961; Longo et al., 1962) have 
also failed to show a significant partial 
reinforcement effect in the extinction 
of classical defense responses in fish 
and pigeons. Finally it is of interest 
to note that Ross, Koski, and Yaeger 
(1964) have recently reported a study 
using human retardates (mean IQ, 
about 19) in which the partially rein- 
forced subjects extinguished at the 
Same rate as those that had received 
continuous reinforcement, It is appar- 
ent from these findings that the partial 
reinforcement effect in classical aver- 
sive conditioning is not present in or- 
ganisms lacking effective higher psy- 
chological processes. . 
In terms of the present theoretical 
analysis the slower extinction following 
partial as compared to continuous rein- 
forcement found heretofore in studies 
with humans is a result of the rela- 
tively slower and poorer discrimination 
of the shift to extinction on the part 
of the partial subjects. In the new 
masking situation this difference in 
discrimination should be minimal if 
not altogether absent; hence the partial 
reinforcement effect should not be evi- 
dent. Examination of the two lowest 
curves in Goldstein's data (Figure 6) 
reveals that, in accord with this pre- 
diction, there is no difference in the 
curves of the partial and continuous 
groups (C-2, P-2) in which the UCS 
was omitted. In the case of the sub- 
jects that had the delayed UCS (C-1, 
P-1), the results a 


re less clear-cut. 
In view 


of the possible reinforcing 
Properties of this procedure, further 
research on this problem should em- 
ploy the new extinction technique de- 
veloped in connection 


task (Spence, 1966). 


with the masking 
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PARTIAL REINFORCEMENT: A HYPOTHESIS OF 


SEQUENTIAL EFFECTS' 


E. J. CAPALDI 
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A hypothesis which attempts to account for extinction following par- 
tial reinforcement in terms of certain sequential aspects of nonrein- 
forced and reinforced trials was considered. The central assumptions 
of the hypothesis are that nonreinforced trials occasion a specific and 
distinctive internal stimulus and that this stimulus is progressively 
modified by successive nonreinforcements. Stimulus modification was 
indicated as exerting its effects on extinction through 3 sequential 
variables: N-length, number of different N-lengths, and number of 
occurrences of each N-length. The operation of these variables and 
the stimulus modification mechanism was elaborated by considering 
certain salient and representative partial reinforcement investigations. 
This detailed survey of critical experimental findings suggested that 
the better known nonsequential variables which form the basis of most 
current partial reinforcement hypotheses exercise little or no control 
over extinction and that a theory of extinction must be based on a 
sequential analysis. Experimental evidence was considered which sug- 
gests that the modification mechanism may be applicable to all partial 


reinforcement situations, whether classical or instrumental, appetitive 


or aversive. The suggestion was a nt h 
to deliver certain types of rate-of-extinction 


These particular rate predictions, while relatively primi- 
be something of an improvement over 


han type prediction. 


pothesis may be able 


predictions. 
tive, do appear nevertheless to 


the current greater-t! 


Reinforcement (R) and nonrein- 
forcement (N) or, more precisely, the 
organism’s reactions to these, appear 
to occasion potent internal stimuli 
which regulate behavior in a variety 
of learning situations. The exclusive 
focus here, however, will be upon the 
role of these stimuli in governing par- 
tial reinforcement (PR) effects. Cen- 
tral to the present treatment is the 
assumption that the isolation of lawful 
relationships in PR is dependent upon 
a detailed and exhaustive examination 
of certain aspects of the sequence of N 
and R trials, a view which current 
major PR hypotheses reject, either ex- 
plicitly or implicitly. The present enr 
phasis on sequential effects is reflected 


_} This work was supported in part by Na- 
tional Institute of Child Health and Human 
Development Research Grant HD 00949-01, 


Iso advanced that the present hy- 


only indirectly, and indeed sometimes 
not at all, by terms currently employed 
to describe PR schedules, such as per- 
centage, pattern, or fixed ratio of re- 
inforcement, etc., these terms having 
originated in different and sometimes 
contradictory theoretical frameworks. 
While such terms are not regarded as 
being particularly useful, either for 
present purposes or generally, they will 
be employed in what follows solely to 
provide a terminological bridge to the 
more appropriate sequential variables. 


PR VARIABLES 


The difficulty of achieving adequate 
experimental control in PR situations 
approaches the excessive; indeed, most 
PR investigations admit of any num- 
ber of interpretations if only because 
the groups compared differ along so 
many dimensions. The general failure 
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TABLE 1 


‘Two SCHEDULES OF REINFORCEMENT 
ATTEMPTING TO ILLUSTRATE THE 
MAJOR SEQUENTIAL AND NON- 
SEQUENTIAL VARIABLES IN 
THE PR SrrUATION 


Trial 
Schedule 
1 2 3 | 4 5 6 
A N N N N R R 
B R N R N | N R 


to come to grips with this problem has 
acted as an impediment to adequate 
and effective theorizing in the PR area. 
At the same time, it has probably led 
to the proliferation of PR hypotheses 
since, in many instances, the introduc- 
tion of a “new” hypothesis amounts 
to little more than an arbitrary em- 
phasis upon this or that PR variable 
to the exclusion of others. A more 
general orientation seems appropriate 
and for that reason the present hy- 
pothesis is considered in relation to 
the major variables in the PR area. 
The two schedules 
appearing in Table 
those employed in a wide variety of 
PR investigations, Those variables 
independent of the Sequence of N and 
R trials are here termed nonsequential 
variables, those dependent upon this 
sequence, sequential variables. Three 
of the four major nonsequentia] 


of reinforcement 
l are typical of 


vari- 
ables are illustrated in the above sched- 
ules: percentage of reinforcement 


(33% versus 50%), number of N 
trials (4 versus 3). and number of 
R trials (2 versus 3), The fourth, 
number of training trials, is Occasioned 
when an investigator attempts, as in. 
frequently occurs, to vary percentage 
while holding constant the number of 
N or R trials. The literature contains 
many investigations in which some sin- 
gle aspect of the distribution of N and 
R trials was varied with percentage of 


teinforcement and total number of ac- 
quisition trials held constant. Such 
investigations have usually varied num- 
ber of transitions from N trials to R 
trials (NR transitions) and have been 
termed patterning investigations(e.g., 
Grosslight, Hall, & Murnin, 1953)? 

It is clear that in a patterning inves- 
tigation, as defined above, the nonse- 
quential variables do not shift in level. 
While this may seem something of an 
experimental advantage, and for cer- 
tain gross purposes it is, the overall 
gain is slight since the sequential vari- 
ables are quite numerous and usually 
all of these have shifted in level along 
with the one selected for experimental 
attention. There are illustrated in Ta- 
ble 1, for example, four varieties of 
trial-to-trial transition alone: NR (1 
versus 2), NN (3 versus 1), RR (1 
versus 0), and RN (O versus 2). 
Other sequential variables illustrated 
include the probability of an R trial 
following an N trial (.24 versus .66) 
and the remaining similar probabilities, 
N following N, R following R, and N 
following R. 

According to the present hypothesis, 
the major determiners of resistance to 
extinction (Rn) are the three length 
variables : N-length, number of differ- 
ent N-lengths, and the number of oc- 
currences of each N-length. The num- 
ber of N trials occurring in succession 
without interruption by an R trial de- 
fines N-length. Schedule A in Table T 
Contains a single N-length of 4, occur- 
Ting but once; Schedule B contains two 
different N-lengths of 1 and 2, respec- 
tively, each occurring only once. 


E Investigations con 
tions were 


d hypothesis € 

i » Or extended (Ca- 
pa, Hart, & Stanley. 1963; Capaldi & 
Spivey, 1963, 1964b ; Capaldi & Stanley. 1963). 


PARTIAL REINFORCEMENT: SEQUENTIAL EFFECTS 


THEORETICAL BACKGROUND 
Stimulus Modification 


Experimental evidence indicates that 
reinforcement and nonreinforcement 
occasion distinctive stimuli, represented 
respectively as S" and S* (e.g., Bloom 
& Capaldi, 1961; Capaldi & Senko, 
1962; Capaldi & Stanley, 1963). The 
basic theoretical assumption here is that 
SN is modified by successive N trials 
and thus is the mechanism underlying 
the N-length variables. In more spe- 
cific terms, the stimulus consequent of 
a single N trial, or S™, differs either 
quantitatively, qualitatively, or both 
from the stimulus consequent of two 
successive N trials or S*«. The modi- 
fication process occurs according to a 
simple growth function so that AS* 
— 0. Stimulus modification continues 
either until an R trial occurs, in which 
event S* is conditioned to the instru- 
mental response and is replaced by s", 
or until the limit of the modification 
process is reached, a possible ent 
given great numbers of consecutive N 
trials as in extinction. 

The primary purpose of ^ 
is to demonstrate that the major find- 
ings in the area of percentage and d 
tern of reinforcement can be explaine 


on the basis of stimulus ped 
i r all soon turn to the St 
E, ires id doing 


eral literature, it is best to avo! ng 
so at present. In all but à iey S 
. investigations, variation 1m N-ler y sd 
upon which stimulus modification de- 
pends, has simply been the asc 

of the manipulation of some othe = 
variable. The better procedure 5° e 
to be to cover first, in rather extens 


: i m 
detail, two investigations xm i 
Specifically designed 3 n a pets 
mind (Capaldi, 1964). When GE 
nally published, the full pern a 
Plications of these investiga m on 


not be spelled out for o detailed 
limitation. Here, however 


this paper 
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coverage of these investigations seems 
almost mandatory, first, to make evi- 
dent certain general principles which 
should transfer without difficulty to 
the remaining PR literature, and sec- 
ond, to provide a concrete illustration 
of the way the hypothesis is to be ap- 
plied in specific circumstances, which 
should facilitate the subsequent inspec- 
tion of the general literature as well as 
indicating how the hypothesis is to be 
tested. 

Single Values of N-length 

One of the simplest theoretical cases 
involves presenting the same number 
of reinforcements for only a single 
value of SX. Two groups, one trained 
under the schedule NR, the other un- 
der the schedule NNR, illustrate the 
case; one reinforcement is provided, 
respectively, for S* and S*& As sug- 
gested previously, however, the impli- 
cations of a fixed-ratio investigation 
of this kind are far too indeterminate 
to be particularly useful for theoretical 
purposes ; the groups differ not only in 
N-length but in terms of number of N 
trials, training trials, percentage of re- 
inforcement, NN transitions, etc. 

A theoretically more useful experi- 
mental situation is provided by em- 
ploying intertrial reinforcement (ITR). 
The ITR procedure consists of placing 
the subject (S) in the baited goal box 
during the intertrial interval. As in- 
dicated elsewhere (Capaldi, Hart, & 
Stanley, 1963; Capaldi & Spivey, 1963, 
1964), an ordinary R trial is assumed 
to serve two general functions; (a) It 
conditions whatever stimuli are opera- 
tive including SY to the instrumental 
response (reinforcement function), and 
(b) it replaces the currently operative 
SN (ie, S*, etc.) with S® (replace- 
ment function). Since ITR occurs 
some time after the locomotor response 
was executed, it can be assumed not 
to serve the reinforcement function. 
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SR SM S^ Sls S S Sto Ss 


Fic. l. Assumed theoretical consequences 
of delivering the same number of reinforce- 
ments at three different values of the after- 
effect, S®, S~, and SX, (The solid vertical 
lines above S", S™, and SX represent the 
habit strength available at each of these 
stimulus points. The generalization of habit 
is also shown. The modification undergone 
by the nonreinforced aftereffect is shown 
along the abscissa.) 


However, like an ordinary R trial, it 
can serve the replacement function, 
Experimental evidence has confirmed 
these assumptions in connection with 
ITR (Capaldi et al, 1963; Capaldi & 
Spivey, 1963, 19642). 

onsider three &roups, all of which 
are trained under the five-trial sched- 
ule, R, N,, Na N,, R. Let ITR occur 
following N, in Group Sh, following 
N, in Group S~, and following N, 
in Group SX, According to the as- 
sumptions considered above, in 
S* ITR will replace Ss with SR. 


trial Similarly, in Group SN, SN 
will be conditioned to the locomotor 
response, and in Group S*, SN will 
be so conditioned. Note that by em- 
ploying ITR it has proved Possible to 
vary N-length while holding all non. 
sequential variables constant, and, pro- 
vided one considers the Schedule of 
reinforcement per se, all Sequential vari- 
ables are held constant as well. The 
assumed theoretical Consequences of 
the above manipulation are Shown in 
Figure 1. For illustrative Purposes 
only, a rather large growth factor (F 
= 1/10) is used in the positive growth 
function employed to estimate stimulus 


modification. Stimulus modification is 
shown along the abscissa of Figure 1. 
The solid vertical lines in Figure 1 
represent the amount of habit strength 
for the instrumental response possessed 
by SE, S, and Ss, respectively. The 
lines are of equal height because each 
group received the same number of 
reinforcements for the particular stim- 
ulus value conditioned, The important 
feature deserving attention in Figure 1 
is that the higher the value of S^ gain- 
ing habit Strength for the instrumental 
Tesponse, the greater the amount of 
generalized habit Strength provided to 
still higher values of Sx (ie. SN» 
etc.). Since, in extinction, the modi- 
fication process continues without re- 
versal, it follows that Rn will increase 
with an increase in the value of S¥ 
conditioned to the instrumental re- 
sponse. An ITR investigation similar 
in its logical features to the one de- 
Scribed above gave the theoretically 
expected result (Capaldi, 1964). 
Thus, it appears that Rn increases as 
N-length increases or, in theoretical 
terms, that Rn increases with an in- 
crease in the value of S* conditioned 
to the locomotor response, 


Multiple Values of N- 


Two other variables were mentioned 
as determiners of Rn: the number of 
times a given N -length occurs and the 
number of different N-lengths. On the 
assumption of a gradually increasing 


habit strength, it i. Clear that Rn will 
Increase with tl 


given SN 
When a sc 


length 


D — 
— —— 
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creasing Rn. The investigation de- 
scribed in this section is concerned with 
these predictions. 

In a recent investigation (Capaldi, 
1964), the summation assumption was 
tested employing a single N-length 
group (G3) and a multiple N-length 
group (G123), both trained in a run- 
way each day under the six-trial 50% 
Schedules appearing in Table 2. Since 
either 24, 60, or 120 acquisition trials 
Were given prior to extincton, there 
Were a total of six groups. Note that 
Since percentage of reinforcement and 
total number of acquisition trials were 
equated for the single and multiple 
N-length groups, the investigation 
would be referred to, employing typical 
terminology, as a patterning investiga- 
tion. That it contains many features 
not incorporated into the typical pat- 
terning investigation will become ap- 
Parent as the discussion develops, es- 
Pecially in later sections of the paper. 

A convincing test of the hypothesis 
that generalized SY tendencies sum- 
Mate to increase Rn requires that two 
Conditions be met. First, the highest 
reinforced value of SN must be the 
Same for all groups (e.g, S™ in the 
Present case). A longer N-length in 
the single N-length group might result 
M greater Rn in that group relative to 
à multiple N-length group simply be- 
Cause N-length is longer, and the ex- 


perimental results would be indeter- 
minate. A longer N-length in the mul- 
tiple N-length group, on the other 
hand, would certainly result in greater 
Rn for that group relative to the single 
N-length group for reasons of N-length 
and summation, and the experimental 
results would again be indeterminate. 

The second condition is that the 
highest value of S" (eg., SX: in the 
present case) must possess equal habit 
strength in both the single and multiple 
N-length groups. If such were not 
the case, the stronger S* in G3 might 
provide more generalized habit strength 
to higher valued Ss than the combina- 
tion of the three weaker SNs in G123 
(SX, SX, Ss), Indeed, as will be seen, 
this prediction was tested at the 24- 
trial level. Notice that in a 4-day 
block (24 acquisition trials), S*« occurs 
four times in G3 and only twice in 
G123. Thus we are still faced with the 
original issue which is, is it possible 
to equate habit strength at S™ yet avoid 
introducing a host of other variables— 
training level, etc.—which would tend 
to impose their own interpretations 
limitations on the results? 

The solution to this difficulty in- 
volves recognizing that habit strength 
mirrors number of reinforcements only 
to a point. That is, once habit be- 
comes asymptotic, further reinforce- 
ments will not further increase habit 
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Fic. 2. Theoretical curves. 


SNo SN SN20 SN30 gNso 


(The modification undergone by the non- 


reinforced aftereffect as a function of successive nonreinforcements is shown 


along the abscissa. Only salient stimulus points, eg, S% 


shown. 


eralization was obtained in ea 
vertical line for habit by 1.5.) 


strength. In line with this considera- 
tion, in G3 S*« received either 4 rein- 
forcements (24 trials), 10 (60 trials), 
or 20 (120 trials) prior to extinction, 
while in G123 the corresponding values 
for S*i SN: and SX were 2 each at 
the 24-trial level, 5 each at the 60— 
trial level, and 10 each at the 120-trial 
level. The growth factor (F) chosen 
for habit was .25 and for stimulus modi- 
fication .05, these F values being 
mined, as will be seen, on the b: 
previous investigations, With F — 25; 
habit becomes nearly asymptotic fol- 
lowing 10 reinforcements. Thus, at 
the 120-trial level, habit strength at 
S~: will be equal or nearly so for G3 
and G123, that is, asymptotic, and the 
summation assumption can be tested. 
The groups receiving 24 or 60 acquisi- 
tion trials are included because they 
place additional constraints upon the 
hypothesis. Most importantly, if the 
assumptions of an F or .25 and .05 for 
habit and stimulus modification, re. 
spectively, are in gross error, the hy- 
pothesis very likely will not fit simul- 
taneously the data obtained from the 
three different training levels. 
Theoretical curves illustrating the 


deter- 
asis of 


, Ss, etc, are 


Figure A depicts the theoretical situation following 24 trials of 
acquisition; Figure B, following 60 trials. 

resent the amount of habit strength availabl 
ditioned during acquisition training. The low 
amount of habit strength available at S. 
line above represents the amount of hab: 
The generalization of the various habit s 
line shows the effects of habit summatio 


The solid vertical lines rep- 
e at each value of S" con- 
er vertical line represents the 
2 for G123; the lower line plus the 
it strength available at Ss for G3. 
trengths is also shown. The dotted 
on in G123. The width of the gen- 


ch instance by multiplying the height of the 


situation following 24 acquisition trials 
are presented in Figure 2A. Since 
G123 received two reinforcements each 
at S*i SX, and SX, each has 43.75 
units of habit strength, while in G3, 
S™: has 68.36 units (four reinforce- 
ments). The dotted line in Figure 2A 
shows the effects of summation in 
G123; it can readily be seen that num- 
ber of reinforcements at S™ should 
outweigh summation as a determiner of 
Rn at the 24-trial level. Summation 
was estimated employing the equation 
Suggested by Hull (1943). According 
to the theoretical curves, the two groups 
should run about equally rapidly in the 
very early extinction trials, with G123 
showing more decrement than G3 later 
in extinction. That is, the extinction 
curves should begin to diverge follow- 
ing the early extinction trials. The 
experimental results confirmed these 
Predictions. 


Figure 2B illustrates the theoretical 


situation following 60 acquisition trials 


and Suggests that G123 should, if any- 
thing, be faster than G3 in the early 
extinction trials but slower later in 
extinction, Thus, the extinction curves 
should converge early in extinction fol- 
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lowed relatively late in extinction by 
divergence. While convergence was 
obtained, late-trial divergence was not, 
presumably because sufficient extinc- 
tion training was not given (30 trials). 
Further, as Figure 2B suggests, differ- 
ences between the groups were small 
and nonsignificant. 

Theoretical curves are not needed at 
the 120-trial level since matters are 
clear without them. At 120 trials habit 
strength at S*s will be approximately 
equal for G3 and G123 and, moreover, 
G123 will in addition have near asymp- 
totic habit tendencies available from 
S: and SX» Thus, while the groups 
are not expected to differ in the very 
early extinction trials, as extinction 
progresses G123 should show less dec- 
rement over trials than G3, that is, the 
extinction curves should diverge over 
trials. These predictions were con- 
firmed by the experimental results 
which demonstrate, as predicted by the 
hypothesis, that the habit tendencies 
available from several S*s summate to 
Increase Rn. 

The investigations considered above 
suggest that N-length, number of dif- 
ferent N-lengths, and number of occur- 
rences of each N-length are critical 
variables determining Rn. Keeping 
them in mind, it is now possible to turn 
to the major findings on percentage 
and pattern of reinforcement. Doing 
so will serve two important purposes. 


Most obviously, it can be determined 
if the N-length variables can be em- 
ployed to explain these findings. It 
can also be determined if the results 
of the investigations just considered 
are in any way incompatible with those 
contained in the general literature. 


Mayor PR FINDINGS 


Small Numbers of Acquisition Trials: 
Pattern and Percentage 


With a single exception (Spivey, 
1964), none of the experiments to be 
considered was specifically designed to 
test the present hypothesis although 
some were designed to test a more 
primitive form of it. It can be ex- 
pected, therefore, that certain inter- 
pretational difficulties will be encoun- 
tered, the most common being that one 
group usually received three or four 
times the number of reinforcements at 
some lower value of S* than another 
received at some higher value. Since 
both factors, number of reinforcements 
and SN value, operate to increase Rn, it 
will prove necessary in the small-trials 
case to present a somewhat detailed 
analysis employing theoretical curves. 
Inspection of those curves may be 
facilitated by bearing in mind a general 
rule-of-thumb from the hypothesis for 
the small-trials case: Number of re- 
inforcements will usually outweigh 
value of SX as a determiner of Rn. 
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Fic. 4. Curves illustrating the theoretical 
situation in Bacon's (1962) investigation. 
(The 70% group received three reinforce- 
ments at S^, the 50% group, one at S^: and 
one at S*s, whereas the 7076 group received 
one reinforcement at S*;) 


That is, if the desire is to produce rela- 
tively great Rn employing a relatively 
small number of acquisition trials, re- 
inforcements should be concentrated at 
a single value of SN (e.g., see G3 ver- 
sus G123 at 24 trials in the previous 
section). The theoretical curves to be 
presented employ the same Set of spe- 
cific assumptions as Previously, that is, 
F= 25 for habit and .05 for stimulus 
modification. "The applicability of these 
Specific assumptions to experiments 
from a Variety of laboratories seems 
encouraging, All small-trial investi. 


gations to be considered employed the 
runway. 


Pattern of reinforcement, 
Separate small-trial patterning investi. 
gations (18 and 27 trials, respectively), 
it was found (Capaldi & Hart, 1962) 
that a 50% alternating (A) schedule 
of N and R trials Produced greater 
Rn than a 50% irregular (1) schedule, 
In the 18-trial investigation, the A 


In two 
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group received eight reinforcements at 
S*5, the I group received two at S. 
In the 27-trial investigation, the A 
group received twelve reinforcements 
at S™, the I group received three at 
S*» Theoretical curves from which 
the greater Rn of the A group, relative 
to that of the T group, may be deduced 
are shown in Figure 3. 

Percentage of reinforcement, Fol- 
lowing 10 acquisition trials, Bacon 
(1962) found Rn to be an increasing 
function of percentage of reinforce- 
ment. An examination of Bacon’s re- 
ward schedules (personal communica- 
tion) indicated that his 70% group 
received three reinforcements at S™, his 
50% group received one at S*: and one 
at Ss, and his 30% group received one 
at S*, "Theoretical curves for these 
groups are shown in Figure 4. For 
the 30% group, the generalization of 
habit tendencies is shown extending in 
both directions along the continuum, 
and for the 50% group the summation 
of the two habit tendencies is shown 
(dotted line), In examining Figure 4, 
it should not be inferred that the pres- 
ent hypothesis expects either that the 

% group would not run in the early 
extinction trials or that the 50% group 
would not run in the later ones. Run- 
ning would be maintained to some ex- 
tent on these trials by sources of habit 
arly germane to 
al analysis (eg 
e 4 shows clearly, 
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however, that the 70% group should 
show greater Rn than either the 50% 
or 30% groups. Presumably, the su- 
periority of the 50% group in the early 
extinction trials outweighed the superi- 
ority of the 30% group in the later 
extinction trials to produce overall 
greater Rn. However, Bacon did not 
present his results in a form which 
allows a determination on this score. 
That some such complicated Trials 
X Percentages interaction may have in 
fact occurred in Bacon’s study is sug- 
gested not only by the theoretical curves 
in Figure 4 but also by the results of a 
recent doctoral investigation by Spivey 
(1964). 

Spivey employed the same 70% 
schedule (70-3) and the same 30% 
schedule (30-1) employed by Bacon, 
employing two other schedules as well 
(70-1 and 30-3). The four schedules 
are shown below in Table 3. 

The schedules are designated on the 
basis of percentage reinforcement (70 
versus 30) and number of reinforce- 
ments (1 versus 3) for some value of 
SS. Spivey found Group 70-3 to show 
greater Rn than Group 30-1, these 
results replicating Bacon's. On the 
other hand, Group 30-3 showed sig- 
nificantly greater Rn than Group 70-1. 
The upshot was that, combining 
Sroups, percentage of reinforcement 
had no effect on Rn while number of 
different SNs reinforced had a S'S- 
nificant effect. In addition, there was 
à significant interaction between trials 
and number of reinforcements for SN. 
For example, Group 70-3, while show- 
ing a greater overall Rn than Group 
30-1, extinguished at a faster rate over 
the first 10 trials of extinction, that is, 
the superiority of Group 70-3 relative 
to Group 30-1 occurred mainly in the 
early extinction trials, a result pre- 
dicted in Figure 4. Indeed, by the 
tenth extinction trial Group 30-1 was 


actually running somewhat faster than 
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Group 70-3. It should be clear, with- 
out presenting further theoretical 
curves, that the results of the Spivey 
investigation are consistent with the 
present hypothesis, as are those of the 
Bacon investigation. 


Extended. Acquisition. Training: Pat- 
tern and Percentage 


As previously indicated, following 
considerable acquisition training it may 
reasonably be assumed in most cases 
that all or nearly all values of S* con- 
ditioned to the response have an asymp- 
totic, or a nearly asymptotic, capacity 
to evoke the locomotor response. 
Thus, in the extended-trial case, as in 
any equal habit strength case, it is 
most easy to formulate a rule of thumb 
for determining which of several groups 
should show the greater Rn. Actually, 
it is the reverse of the rule applied 
to the small-trial, or unequal habit 
strength, case. For small trials, it was 
indicated that the best way to increase 
Rn was to concentrate reinforcement 
at a single value of SX. For extended 
training, the hypothesis recommends 
the opposite procedure: Spread rein- 
forcements over a number of different 
values of SY, particularly high values 
of SN, The two rules of thumb taken 
together suggest that the schedule which 
produces the least Rn for small num- 
bers of training trials should produce 
the greatest Rn for extended numbers 
of training trials, and vice versa. As 
will be shown below, this general ob- 
servation appears to be consistent with 
all available evidence. 

Pattern of reward. An example of 
the Schedule x Number of Acquisition 
Trials interaction has already been cov- 
eral in connection with pattern of re- 
ward. It may be recalled from a pre- 
vious section that G3 showed greater 
Rn than G123 at 24 trials and less Rn 
at 120 trials. Moreover, following 120 
trials Tyler, Wortz, and Bitterman 
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which is central to the present treat- 
ment. Moreover, that nonreinforce- 
ment is “registered” independently of 
a tendency toward extinction is one 
important aspect of the stimulus modi- 
fication assumption. 


COMMENTS ON Mayor FINDINGS 


Every effort has been made to cover 
a representative sample of PR find- 
ings, with respect both to training level 
and schedule of reinforcement, More- 
over, practically every investigation 
included in the literature survey has 
at one time or another been replicated. 
In the few cases where this was not 
so, the unreplicated findings are clearly 
consistent with the remaining litera- 
ture, if not actually implied by that 
literature. What have been called 
major findings are so, then, in two 
Senses, representativeness and replica- 
bility, 

While the N- 


length variables appear 
to have surviv 


ed the test of the major 
findings, it Seems unlikely on the basis 
of these findings that the nonsequential 


variables exert any but the mildest of 
influence on Rn, 


times accompanied increased Rn (Spi- 

1964; 1954, 1958) 
and sometimes accompanied decreased 
Rn (Bacon, 1962; Spivey, 1964), 
Moreover, Rn has been shown to vary 
when both percentage of reinforcement 
and number of N trials remained con- 
stant (Capaldi, 1958; Gonzalez & Bit- 
terman, 1964; Tyler et al., 1953). 
Interestingly, even a restricted claim 
for a nonsequential variah; 
ters overwhelming 
tested against the experimenta] evi. 
dence. For example, it has been sug- 
gested (eg., Bacon, 1962; Grant & 
Schipper, 1952) that at least for small 
numbers of training trials Rn increases 


le encoun- 
difficulties when 
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as number of R trials increase. „This 
restricted generalization is inconsistent 
with the results of small trial pattern- 
ing investigations (eg. Capaldi & 
Hart, 1962) and, among percentage 
investigations, with Spivey (1964) 
where following 10 acquistion trials Rn 
Was greater for a group given three R 
trials than for a group given seven R 
trials (30-3 versus 70-1). Thus, it 
would appear that the conclusions 
available from the two investigations 
covered in detail previously (Capaldi, 
1964) in which N-length, number of 
different N-lengths, and number of 
occurrences of each N-length were 
varied are adequately supported by the 
results of investigations contained in 
the general literature which, for the 
most part, were neither concerned with 
the N-length variables nor designed 
with these variables in mind. More- 
over, it would appear on the basis of 
available evidence that it is possible to 
Systematically rule out of consideration 
as major determiners of Rn practically 
all of the numerous variables contained 
in the typical schedule of partial rein- 
forcement except the N-length vari- 
ables. 

The exceptions, if such actually exist, 
are number of RR and number of RN 
transitions, According to a more ex- 
panded form of the present hypothesis, 
these variables should exert some influ- 
ence on Rn. Our impression is that 
an especially detailed analysis would 
show both variables to exert some in- 
fluence on Rn Under PR schedules. 
The need for “an especially detailed 
analysis” is occasioned by the fact that 
the far more Powerful N-length vari- 
to have obscured the ef- 
fects of these “milder” transition vari- 
ally every PR investiga- 
Be that as it may, 
RR or RN transi- 
time would occa- 


Sion a highly Speculative discussion, 
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one which for a variety of reasons is 
perhaps best avoided. 


FURTHER IMPLICATIONS 
Experimental Situations 


Ordinarily, PR hypotheses tend to 
deal with a relatively restricted class 
of experimental situation, for example, 
the present one has tended to deal 
mainly with appetitive conditioning in 
the runway. Such selectivity some- 
times occurs because the mechanisms 
of a given hypothesis plainly do not 
extend beyond the experimental situa- 
tion chosen and sometimes for other 
reasons. The most obvious and impor- 
tant issue resulting from this selectivity 
concerns the number of different sets 
of laws which need to be formulated to 
deal with all PR situations (cf. for 
example, Spence, 1960). Hopefully, 
One set of laws will prove sufficient. 

The stimulus mechanism of the pres- 
ent hypothesis seems to have à broad 
application, one which, at least in prin- 
ciple, could apply to all PR situations. 
Since the hypothesis is proposed as 
being as applicable, say, to classical 
aversive conditioning as to the instru- 
mental runway situation, the primary 
issue to be faced at present is how well 
the hypothesis fits the data in experi- 
mental situations other than the run- 
way. Our impression is that the fit 
1s good. 

For example, Jensen (1964) has 
Shown that N-length and number of dif- 
ferent N-lengths affect Rn in the free 
Operant situation (rat subjects (Ss) and 
bar pressing). It should be noted that 

ensen’s free operant results are Con- 
sistent with those of Gonzalez and Bit- 
terman (1964) who employed discrete 
bar presses, although the Jensen inves- 
tigation was by far the more elaborate. 
Jensen employed five levels of 
length (1, 15, 20, 50, and 100) and 
three levels of occurrence of each N- 
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length (1, 3, or 7) and found both 
variables to have a significant effect 
on Rn. 

Unfortunately, however, the avail- 
able data from many experimental sit- 
uations are far too unsystematic to 
allow as rigorous a test of the present 
hypothesis as would seem both desir- 
able and necessary. For example, 
Wagner, Siegel, Thomas, and Ellison 
(1964) have recently indicated that 
there are no systematic data available 
on the extinction of classical rewarded 
responses in infrahuman Ss as a func- 
tion of percentage of food reinforced 
trials during acquisition. They con- 
ducted such an experiment with dogs, 
comparing 50% with 100% reinforce- 
ment, and suggested that the extinction 
findings, while not compatible with 
alternative PR hypotheses, could be 
understood on the basis of greater 
generalization decrement in the 100% 
group “. . . as traces of nonreinforce- 
ment rather than reinforcement are first 
introduced for this group into the stim- 
ulus complex [p. 358].” Consider, too, 
a finding reported by Grant, Riopelle, 
and Hake (1950) employing human Ss 
in an aversive eyelid conditioning situ- 
ation. Grant et al. found alternating 
schedules produced greater Rn than 
irregular schedules, a finding, which 
it will be remembered, similar to that 
reported by Capaldi and Hart (1962) 
following a small number of acquisition 
trials in the runway. The Grant et al. 
finding could offer strong support for 
the present hypothesis ; indeed, we are 
aware of no other hypothesis from 
which it can be deduced. It could 
offer such support but, in one impor- 
tant respect, it does not. This is so 
because there is no direct way of de- 
termining if the number of acquisition 
trials employed by Grant et al. was 
small for the eyelid situation since, 
unfortunately, 2 comparable investiga- 


tion employing more extended acqui- 
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sition training does not seem to be 
presently available. If an extended 
trial investigation showed irregular 
patterns produced greater Rn than al- 
ternating patterns, the probability 
would be increased that the present 
hypothesis is applicable to the eyelid 
situation and perhaps to other aver- 
sive situations as well. 

Studies from the Bryn Mawr labora- 
tory employing the fish and the bird 
have shown that following extended ac- 
quisition training (400 or more trials) 
long N-length produces greater Rn 
than short N-length ( Gonzalez, Eskin, 
& Bitterman, 1963: Gonzalez, Graf, & 
Bitterman, 1965), but following a brief 
acquisition period (50 trials) long and 
short N-length produce about the same 
Rn (Gonzalez, Behrend, & Bitterman, 
1965; Roberts, Bullock, & Bitterman, 
1963). These results for fish and 
bird are similar to those for the rat 
considered in detail earlier in showing 
that the greater Rn associated with 
longer N-lengths occurs only after con- 
siderable acquisition training, that is, 
in all three organisms, fish, bird, and 
rat, there appears to be a similar sched- 
ule of Reward x Acquisition 


teraction. This corresponden 
sults between different 
particularly in view 
ber of investigation 
level of the fish and bird, seems some- 
what encouraging. Tt would appear 
that the length variables govern per- 
formance in animals at least as low 
on the phylogenetic scale as the fish, 
and moreover that the length variables 
govern performance in all three organ- 
isms in a highly similar fashion, Be- 
fore this conclusion may be accepted, 
however, further research is needed, 
particularly along two lines. First, in 
the rat long N-length produces greater 
Rn than short N-length in much 
than 400 trials (eg., 

1958; Capaldi & Hart, 


Trial in- 
ce in re- 
organisms, 
of the small num- 
s involved at the 


less 
see Capaldi, 
1962; and 
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Weinstock, 1954, 1958). This may 
mean several things, among them that 
stimuli such as SY and SF are condi- 
tioned more rapidly in the rat than in 
the fish and perhaps in the bird. If 
this conjecture is correct, it may be 
expected that in order to produce the 
Same sort of schedule of Reward x 
Acquisition Trial interaction in all 
three organisms, more acquisition 
training will be required at the level of 
fish and bird than at the level of the 
rat. It should also be indicated that 
the fish and bird investigations in which 
N length failed to increase Rn em- 
ployed long intertrial intervals (24 
hours). While it is not known 
whether long N-length will increase 
Rn in the fish and bird when a long 
intertrial interval is employed, the fol- 
lowing comment seems in order. Even 
in the case of the rat, 50 acquisition 
trials is hardly sufficient to increase 
Rn as a function of N length (e.g. 
Capaldi, 1958). Thus, it is not un- 
likely that in those instances in which 
N-length failed to increase Rn in the 
fish and bird, the failure may have 
been more related to insufficient train- 
ing than to intertrial interval. In the 
rat, of course, N-length will increase 
Rn both at short (e.g., Capaldi, 1958) 
and long (eg., Weinstock, 1954, 1958) 
intertrial intervals. Thus, another 
Possibility is that intertrial interval 
produces somewhat different effects in 
rat, fish, and bird. However, before 
this conclusion Can be established it 
Will be necessary to run N-length in- 
vestigations on the fish and bird which 
employ considerable acquisition train- 
ing as well as long intertrial intervals. 


Dependent Variables 


The theoretical 
earlier suggest that, 
Present hypothesis eve 
capable of Specifying 
tude of the response 


curves presented 


in principle, the 
ntually should be 
the exact magni- 
on such and such 
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a trial of extincton. Predictions of 
this sort are, of course, the ultimate 
aun, of behavior theory. How seriously 
this capacity for specificity in predic- 
tion should be taken at the moment is 
difficult to say. Of course, any ad- 
vance over the typical state of affairs, 
which is to predict that one group 
should show greater Rn than another 
(greater-than predictions), would be 
welcome. Quite possibly, the present 
hypothesis may be currently capable of 
delivering rate-of-extinction predic- 
ia of the general form, Group A 
ag be faster than Group B in the 
sls / extinction trials but slower in 
a: ater ones. This proposition is 
: anced because, as indicated pre- 
viously, the hypothesis has already en- 
TM some success along this line 
cepe 1964 ; Gonzalez & Bitterman, 
Fate pivey, 1964). But rate predic- 
dike are far more sensitive to the spe- 
e conditions of an experiment than 
for — ee predictions. Consider, 
een xample, Boren's (1961) investiga- 
the which showed Rn increased as 
Shines of the fixed-ratio (FR) in- 
P Boren provided "starter Te- 
(2n rCements at a very low value of FR 
2:1) before shifting some goups to 
S ed training values of FR (5:1 to 
d According to the hypothesis, 
tior 1 pretraining should produce distor- 
1 in the shape of Boren’s extinction 
curves. Under Boren’s conditions, 
this distortion should have appeared 
Mainly in the later extinction trials for 
Smaller values of FR (e.g« 5:1) and 
Mainly in the early extinction trials for 
higher values of FR (e.« 20:1). In 
Substance, when greater-than predic- 
tions are contemplated, the decision as 
2 how many reinforcements to provide 
for this or that value of SN is merely 
Important; for rate predictions, the 
decision is absolutely crucial. | 
Course, purely practical considerations 
enter here, Tt is obviously rather diffi- 
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cult to get an animal to respond con- 
sistently at FR 20:1 without prior pre- 
training at, say, FR 2:1. In this con- 
nection, Boren notes that Rn does not 
seem to be further increased by FR 
beyond 20:1, specifically citing FR 
50:1. Observations of this sort are 
helpful in determining when the limit 
of the stimulus modification process 
has been reached. Once it is reached, 
of course, further increases in N-length 
will not further increase Rn. 


Other PR Variables: Magnitude of 
Reward and Intertrial Interval 


Magnitude of reward, It has been 
shown (eg. Hulse, 1958; Wagner, 
1961) that under an irregular PR 
schedule Rn is an increasing function 
of reward magnitude. This finding 
may be understood within the frame- 
work of the present hypothesis by en- 
tertaining an assumption popularized 
and then later abandoned by Hull 
(1943, 1952) that the limit to which 
associative strength may grow is an 
increasing function of reward magni- 
tude. This assumption has been more 
or less universally rejected in recent 
times in favor of incentive interpreta- 
tions of the reward magnitude variable 
(eg. Hull, 1952; Spence, 1956). 
Accordingly, it seems unlikely that 
learning theorists will be tempted to 
abandon the currently popular incen- 
tive viewpoint for the older, seemingly 
discredited associational one in the ab- 


sence of a rather complete associational 


analysis of the magitude variable. An 


sis of this type has been presented 
(Capaldi, in press). 

It is assumed that the greater the 
magnitude of reward under which a 
particular value of SN is conditioned to 
the instrumental reaction the greater 
will be its associative capacity to evoke 
that reaction. Moreover, the greater 
the associative capacity of S* to evoke 
its reaction. the greater will be the 


analy 
elsewhere 
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amount of generalized associative 
strength supplied to still higher values 
of SN. Tt already has been shown that 
Rn is an increasing function of the 
associative capacity of the higher val- 
ues of S to evoke the instrumental 
reaction. 

The deduction that Rn will increase 
as magnitude of PR increases is ap- 
plicable without reservation only to the 
case in which the different magnitude 
groups receive the same schedule of 
PR. Under such conditions all groups 
would have the same value of S" con- 
ditioned to the reaction and each of the 
values would be conditioned with equal 
frequency; the determinate factor 
would thus be that the associative 
capacities of those SNs conditioned to 
the reaction would be greater, the 
greater the magnitude of reward, 


regular PR schedule employ: 


1 E ed was 
identical 


for the large reward and 


he present hypothesis, being a se- 
quential one, must expect that the ef- 
fects of reward magnitude w 


reaction is 
of occur- 
nitude of 


itions the 
former variable may well outweigh the 


This may be illustrated at a 
relatively simple level by considering 
two PR groups which receive a small 
number of training trials, Tt was 
shown for small numbers of acquisi- 
tion trials, it will be remembered, that 
Rn is primarily regulated by number 
of occurrences of N-length. Thus, a 
PR schedule employing small reward 
but many transitions from N trials to 
R trials could conceivably produce 
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greater Rn than one employing large 
reward and a substantially fewer num- 
ber of transitions from N trials to R 
trials. That is, SN might possess a 
substantially greater capacity to evoke 
its reaction under the former schedule 
(more conditionings of SS) than under 
the latter schedule. Experimental re- 
sults bearing on this issue are not cur- 
rently available; actually, it appears 
that no attempt has been made as yet 
to combine the sequence and magnitude 
variables in a single investigation. 
This is probably related to the fact that 
other PR hypotheses which have al- 
ready dealt with magnitude of reward, 
for example, the frustration hypothesis 
(Amsel, 1958; Spence, 1960), have 
not as yet addressed themselves to the 
Sequence variable, 
Intertrial interval, 

characteristics of stimu 
and SN are of 
ticularly si 
pretation Sheffield, 
1949) they are Stimulus traces which 
dissipate relatively rapidly. However, 
em to have been 
directly indicated previously, it is en- 
tirely possible to identify SR and SN 
as the Tesponse-produced stimuli of a 


mediating reaction In this 


The temporal 
li such as S® 


As 


influence the type of 


; ults obtained under 
R since, by definition, stimulus traces 


tervals (ITT) and inoperative at 
longer ones. It was on the basis of 
cinstock’s (1954 
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st and S* as stimulus traces was first 
Tejected since PR was shown to in- 
crease Rn despite the employment of 
a 24-hour ITI. More recently, a num- 
ber of investigations from our labora- 
tory (Capaldi & Spivey, 1964a, 1964b ; 
Capaldi & Stanley, 1963; Capaldi & 
Wargo, 1963) have supported Wein- 
Stock's conclusion in showing that a 
variety of PR variables produces ap- 
Proximately similar results under long 
and short TTIs. 

However, rejecting the idea that S" 
and SN are stimulus traces is not to 
reject the idea that they regulate per- 
formance under PR. Tt seems rela- 
tively clear that if, as has been at- 
tempted in this paper. one can supply 
in terms of S* and SY a consistent 
explanation for massed and spaced trial 
PR data, then it is justifiable to as- 
Sume that such stimuli are functional 
at long ITIs as well as short ones. 
Tn this event, as indicated, one plausi- 
ble alternative is to view S? and Sh 
as the response-produced stimuli of a 
mediating reaction. By doing so, how- 
ever, we appear to commit ourselves to 
a somewhat different view of mediating 
reactions than has prevailed in animal 
learning since Hull’s (1930) original 
Paper on the topic. This may be 
clarified by examining two contrasting 
Properties normally assigned to stimu- 
lus traces and mediating reactions. 

The stimulus trace is considered to 


dissipate relatively rapidly, on the = 
hand, and to be more or less CO dj 
urrent rewar 


pletely responsive to the ¢ 
Condition, on the other. 

9f completely responsive i$ 
= the current trial is nonre 
race on the succeeding tra’ Y 
that of ee The mediating Te- 
action is normally assigned somewhat 
Opposite properties. It is, dE i 
Considered to be long-lived. More 
Over, it is considered to be only par. 
tially responsive to the current rewar 


The meaning 
as follows: 
warded the 
jal will be 
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outcome. Thus, if rz, is strongly es- 
tablished, a single nonrewarded trial 
will reduce its strength through extinc- 
tion only slighlty (e.g., Logan, Beier, 
& Kincaid, 1956). 

According to the current PR hy- 
pothesis neither of these mechanisms 
is entirely satisfactory. What the pres- 
ent hypothesis requires is a stimulus 
which is like that produced by the me- 
diating reaction in being long-lived but 
which is also like that produced by the 
trace mechanism in being more or less 
completely responsive to the current 
reward outcome. The former require- 
ment, as already indicated, is dictated 
by the results of spaced-trial PR in- 
vestigations, the latter requirement by 
the results of practically all PR inves- 
tigations covered in this paper. In 
numerous places throughout this paper 
it was asumed that S® and SN were 
more or less completely responsive to 
the current reward outcome. In the 
absence of that assumption the present 
interpretation would not be feasible. 

Considerations advanced in previous 
sections of this paper suggest that the 
notion of what might be called a com- 
pletely responsive mediating reaction 
can be applied to extinction. Can it 
be applied to acquisition? The tradi- 
tional or partly-responsive view of me- 
diating reactions entertains two as- 
sumptions with acquisition: That fg is 
formed slowly and that some number 
of trials are required to condition the 
stimulus it produces (sg) to the instru- 
mental reaction (eg., Amsel, 1958). 
It appears possible to reject the former 
assumption and accept the latter one 
and still retain the capacity to deal with 
current acquisition data in mediation 
terms. That is, it may be assumed that 
the mediating reaction develops quickly 
but that the stimuli it produces are 
conditioned to the instrumental reac- 
tion rather slowly. This is to say in 
effect that expectancy reactions, that is, 
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Tg, are conditioned to their stimuli at 
a much more rapid rate than instru- 
mental reactions are conditioned to their 
stimuli. At a common sense level, the 
present view suggests that the animal 
quickly learns about reward or non- 
reward or punishment in the experi- 
mental situation but that a number of 
additional trials are required before it 
learns what to do about its diffuse ex- 
pectations. At a more formal level, 
the present view suggests that expect- 
ancy or mediating reactions and instru- 
mental reactions are conditioned at 
different rates. 
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THEORETICAL NOTE 


REINSTATEMENT? 


BYRON A. CAMPBELL anp JULIAN JAYNES 


Princeton University 


man early experience exerts a multiplicity 
of effects on adult behavior (Beach & 
Jaynes, 1954; Scott, 1962), 


S perceptual 
And in still 
is a critical 


In this Paper we suggest yet another 
mechanism. Although obvious and dis- 
armingly simple, 
authors of such ne. 
to warrant this not 


term for it. Y reinstatement we denote 
a small amount of Partial practice or 
repetition of an experience Over the de- 
velopmental period which is enough to 
maintain an early learned response at a 
high level, but is not enough to produce 
any effect in animals Which have not 
had the early experience. The following 
experiment is meant as a demonstration 
of this phenomenon in a commonly studied 
instance of learning. 


METHOD 


The subjects were 30 albino rats of the 
Wistar strain born and raised in the Prince- 


1 This research was supported in part by 
Public Health Service Grant M-1562 from 
the National Institutes of Mental Health and 
by National Science Foundation Grant GB 
2814. 


ton colony. They were divided into three 
groups of 10 each, with an equal number of 
males and females in each. The apparatus 
used was one commonly used in fear experi- 
ments (Campbell & Campbell, 1961). It 
consisted of two compartments separated by 
a door, a black one with a grid floor, and a 
white Compartment with a solid metal floor. 
Shock could be administered to the grid of 
the black compartment. To two of the three 
groups an early fear-arousing experience 
Was given in the black compartment. This 
consisted of Placing the rat just after wean- 
ing, when approximately 25 days old, on the 
grid side of the apparatus with the door fixed 
so that the rat could not escape, then giving 


the rat 15 2-second 170-volt shocks on a 


20-second variable intery, 


» and then repeating this entire 
Procedure once, Thus each animal received 
a total of 30 shocks, At the end of this 
Period the rat was removed and placed in a 
home cage. A third control group was run 


without any shock 
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half on the safe side. Otherwise this pro- 
cedure was precisely the same as the training 
Procedure except that only 1 instead of 30 
shocks was administered. The second pre- 
trained group was given the same procedure 
Beret that no shock was administered. 
i a week after the third reinstatement pro- 
*dure, when the animals were 53 days of 
dn they were all tested for the effects of 
pie early experience. This was done by 
Placing them individually in the black com- 
cent (where all of them had been 
Shocked at one time or another) with the 
frad removed so that the animal could run 
imi" mto the white compartment. The 
the ‘ee in the white compartment over 
isuing hour was then recorded. 


Resutrs AND DISCUSSION 


5 The results were unequivocal. As seen 
a, Figure 1, the group that had received 
in early fearful experience followed by 
Mee 2-second shocks administered at 
Weekly intervals, spent an increasing 
Percentage of its time in the white com- 
ipstüent during the l-hour test period, 
lus showing the effects of the early fear- 
z experience with the black compart- 
tent. In contrast, the group that had had 
henge carly experience just after wean- 
feet no reinstatement of it in the 
Si Crvening month, failed to show any 
s Snificant fear of the black compartment, 
ene on the average all but about 10 
aie of the hour on that side. Simi- 
early the group which had not had any 
Cei Y traumatic experience, but had = 
ies Uie three brief shocks Over te 
ea failed to acquire any significant 
diffe Of the black compartment. E 
t erence between the first group and th 
th lér two groups is, as it appears on 
* graph, highly reliable statistically (P 
:01, Mann-Whitney U test). . 
in here is nothing dramatically ua 
* 8 about this finding. It is indeed wl 
Nyone thinking carefully about bro 
Practice would expect, namely, tiat 
aye 1S some small amount of "pe 
m X" Certain time intervals which c! z 
antain a previously learned respons 
ani Yet not be enough to tram naive 
en ed to perform that response, Ae 
8 ater ility that this mechanism 9 A 
‘ment has wide and important appi! 


ther, 
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Fic. 1. The effect of reinstatement of early 


fear on later behavior. 


cability in the ontogeny of behavior in 
many vertebrate species seems beyond 
question. 

In theoretical analyses of human 
growth and development traumatic events 
in infancy and childhood have long occu- 
pied a central, if controversial, role. In 
Freud's early analyses, traumatic events 
in childhood were considered a major 
cause of adult behavior disorders. With 
time, this view Was gradually modified 
such that White writing in 1956 summed 
up current opinion by stating: 


ubtedly it is true that some adult neu- 
roses have their origin in violently fright- 
ening events. . - - The theory has long since 
been abandoned, however, that all neuroses, 
or even a majority of neuroses, take their 
start from traumatic events [1956, p. 238]. 


y trauma theory has incon- 
th certain facts of memory 
E rning as well First, on a mere 
s hanomenological level, we know that 
memory becomes more and more dim the 
further back into our childhood we try to 
remember. Second, in rats, the earlier 
in life that a fearful experience is given 
the animal, the more likely it is to be 
forgotten in adulthood (Carapheil & 
Campbell, 1962). Third, in chickens, 
the earlier in the critical period that the 
hick is imprinted, the more likely it is 
i in forgotten when the animal reaches 
sd juvenile stages (Jaynes, 1957). This 


Undo 


The earl 
sistencies WI 
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nized “sleep” patterns, it was 
eae that this was due to the 
removal of the trigeminal and vestibu- 
lar tonic barrage. 
This view of sleep was prevalent 
until Moruzzi and , Magoun (1949) 
showed that stimulation in parts of the 
midbrain RF of a cat put to sleep by 
barbiturates or chloralose yielded a de- 
synchronized EEG. They interpreted 
their results as showing that the middle 
core of the midbrain contains a struc- 
ture that diffusely arouses the organism 
and maintains it in an alert state. 
Their inference was based on the as- 
sumption that the  desynchronized 
EEG represented an awake pattern. 
If their interpretation were correct, 
then it should have been possible to 
eliminate any form of alertness in a 
chronic preparation by removing the 
structure. This is what Lindsley, 
Schreiner, Knowles, and Magoun 
(1950) did. They showed that cats 
with massive lesions in midbrain RF 
were asleep or comatose during the 
postoperative period. They also 
Showed that lesions in lateral areas, 
involving the primary sensory path- 
ways, did not produce these effects, 
From these studies and subsequent ex- 
periments (for reviews see French, 
1960; Lindsley, 1960; Magoun, 1963; 
Samuels, 1959) the following picture 
of the brain mechanisms underlying 
sleep emerged. ^ Bremer's cerveau 
isolé preparation showed a Synchro- 
nized sleep pattern because of the sever- 
ing of the RF from the cerebrum. The 
encéphale isolé preparation could show 
a desynchronized pattern in the absence 
of spinal inputs because the RF was 
still capable of bombarding the cortex, 
It was also pointed out that the activity 
of the RF was maintained by collaterals 
of primary sensory pathways into the 
RF (Starzl, Taylor, & Magoun, 1951), 
Thus sleep was considered a passive 
process, brought about by the inactivity 
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of the RF, presumably caused by the 
reduction of sensory inputs into that 
Structure. 


Recent Data on SLEEP 


Aserinsky and Kleitman (1955) 
noted that during a night's sleep there 
were periods in which Ss showed rapid 
eye movements (REM). These pe- 
riods of REM occurred about every 
60 minutes for 20-30 minutes; the 
duration of REM was longer in the 
later hours than in the beginning of 
sleep. Dement and Kleitman (1957b) 
Showed that these periods of REM 
were related to periods in which S was 
most likely dreaming. They roused 
their Ss both during periods of REM 
and periods where there were no rapid 
eye movements. They showed that the 
amount of dream recall was much 
greater during REM periods. These 
authors interpreted their results as 
demonstrating that dreaming occurred 
during REM. It should be pointed 
out, however, that dream recall is pos- 
sible during other stages of sleep 
(Foulkes, 1964), Dement and Kleit- 
man (1957a) also showed that during 
these REM periods the EEG showed 
a desynchronized pattern, similar to 
that seen in the waking human! Al- 
though this same result had been seen 
by Derbyshire, Rempel, Forbes, and 
Lambert ( 1936), interpretation was 
difficult since it was not clear whether 
these dreaming Ss were sleeping or 
were, in fact, awake, Tf sleeping, why 
was their EEG record similar to that 
seen in the waking $? If awake, how 
were they dreaming ? 

Dement (1958) investigated these 
problems by studying the EEG pat- 
terns of the cat chronically implanted 
with recording electrodes, He kept the 
animal awake for 2 days, then fed it 
With warm milk and food, placed 
S, with electrode leads attached, in a 
quiet chamber, and waited for S to fall 
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asleep. He found that there was a pro- 
gressive synchronizing pattern as S 
went to sleep, until S showed a com- 
pletely synchronized EEG. Following 
this, desynchronized EEG patterns sud- 
denly appeared. The REM were seen, 
as well as twitching, and other small 
motor movements. This period was 
called Stage 1-REM. The progres- 
Sively more synchronized stages of the 
EEG were called 2, 3, and 4. Follow- 
mg 1-REM, Stages 2, 3, and 4 suc- 
Cessively appeared, and then the cycle 
began again. This cycle was repeated 
throughout the night. Dement (1958) 
also attempted to awaken S during the 
various stages of the EEG. He found 
that it was more difficult to do so dur- 
ing Stage 1-REM than during any 
Other stage. Dement identified this 
Stage 1-REM as sleep, but was not 
certain whether it stood between wake- 
fulness and sleep or represented a very 
deep stage of sleep. One could assume 
that this was a light stage of sleep, and 
that the reason it was difficult to arouse 
S was that Ș was distracted, attending 
to something else, most probably its 
dream. On the other hand, one could 
assume that this was a deeper stage of 
Sleep than synchronized sleep, and that 
the reason § was not aroused was sim- 
Ply because of its depth of sleep. The 
man who has championed this latter 
View is Michel Jouvet. It is his work, 
and that of the Pisa school, along with 
these pioneering studies of Dement that 
have suggested revisions of certain no- 
tions concerning mechanisms of sleep 
and EEG. Before continuing, it will 
= wise to interrupt this history and 
briefly mention some problems in the 
Interpretation of the EEG. 


EEG iw Steep AND WAKEFULNESS 


Until recently, the early EEG work 
of Rheinberger and Jasper (1937) re- 
mained more or less accepted. These 
Investigators as well as others (Rossi 


& Zanchetti, 1957, P. 307, for refer- 
ences) had shown that the EEG ex- 
hibited several patterns which paralleled 
certain behavioral states. The low- 
voltage, high-frequency desynchronized 
pattern was often termed an aroused 
or awake EEG, while the high-voltage, 
low-frequency EEG pattern was often 
called the sleep EEG. It now appears 
that there are cases in which this paral- 
lelism does not stand up. The work of 
Dement, for example, suggests that it 
is possible to obtain a desynchronized 
awake EEG during sleep. 

Jouvet (1962) has shown that the 
anticholinesterase eserine increases the 
desynchronized EEG pattern seen in 
the sleeping animal. It has also been 
possible to demonstrate a synchronized 
EEG pattern while S is fully awake. 
For example, Wikler (1952) and Brad- 
ley and Key (1958) have shown that 
Ss injected with atropine show a syn- 
chronized EEG, but show behavioral 
alertness. Feldman and Waller (1962) 
have demonstrated the same result with 
selective brainstem lesions. In sum, 
desynchronized or low-voltage fast ac- 
tivity (LVF) may be associated with 
either sleep or wakefulness, and syn- 
chronized or high-voltage slow activity 
or slow waves (SW) may likewise be 
associated with either sleep or wake- 
fulness. 

These considerations do not elimi- 
the EEG as a useful tool in brain 


nate A > 
do make it quite 


physiology, but they 
clear that: 

1. The use of the terms arousal pat- 
tern and sleep pattern in characterizing 
certain EEG outputs is sometimes in- 
correct. The substitution of the terms 
desynchronized, or low-voltage fast ac- 
tivity, and synchronized, or high-volt- 


age slow activity or slow waves, seems 


preferable. 
2. It is inadequate to use EEG rec- 


ords alone to infer level of conscious- 


ness without reference to the behavior 
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of S as well as other independent physi- 
ological measures. One of the major 
problems associated with the research 
on sleep is that of describing the state 
of awareness of S, since it is now clear 
that the EEG alone cannot be used, 
and that converging operations (Gar- 
ner, Hake, & Ericksen, 1956) are nec- 
essary to demonstrate the presence or 
absence of consciousness. It would 
seem from these considerations that, 
rather than considering the EEG a use- 
less neurophysiological tool, it would 
be more profitable to investigate the 
complexity which has been thus intro- 
duced in order to clarify those mecha- 
nisms underlying the synchronization 
and desynchronization of the EEG seen 
both during wakefulness and sleep. 


Work or Jouver 


Originally published in French 
(Jouvet & Michel, 1958), Touvet's 
work has been reported in English and 
French at several symposia (Jouvet, 
1961a, 1961b, 1963) ; the most exten- 
Sive. summary of his work (Jouvet, 
1962) has been reported in French. 
Essentially, Jouvet has continued the 
work of Dement (1958) and has 
elaborated the possible telencephalic 
(cortex, hippocampus, Septal area, 
etc.), mesodiencephalic (thalamus, hy- 
pothalamus, midbrain RF), and rhom- 
bencephalic (pons, medulla) mecha. 
nisms that may be involved in the 
production of sleep. His experimental 
evidence and bold interpretations of 
data have made this area of research 
important in attacking many problems 
concerned with the organization of the 
central nervous system during sleep 
and waking. " 3 

Jouvet has worked mainly with cats, 
chronically implanted with electrodes 
in cortical, subcortical, and brainstem 
areas. He finds that there are basically 
two stages of sleep, which he calls 
“slow” and “fast” sleep. The former 
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is sometimes called telencephalic and 
the latter rhombencephalic, to refer to 
the general anatomical area hypothe- 
sized to mediate the two stages. Dur- 
ing the slow stage of sleep (S-SW), 
the EEG shows spindles (12-18/sec- 
ond) followed subsequently by SW 
(2-4 cycles/second) which appear in 
cortex, thalamus, and mesencephalic 
and pontine RF. During this time, the 
hippocampus shows high-voltage spikes. 
The electromygraphic potentials are 
present, and the midbrain RF stimulus 
necessary to wake S is 20-30% greater 
than during falling off to sleep. Dur- 
ing sleep with low-voltage fast activity 
(S-LVF), or the paradoxical or rhom- 
bencephalic stage of sleep, there is low- 
voltage desynchronized activity, similar 
to that seen in the waking state. The 
hippocampus shows the arousal pat- 
tern of theta activity originally de- 
scribed by Green and Arduini (1954). 
In pontine RF, and more precisely in 
n. reticularis pontis caudalis (RPC), 
high-voltage waves (8/second) in the 
form of spindles are seen to occur. 
This 8/second activity is also seen in 
the interpeduncular nucleus and the 
posterior hypothalamus. While the 
electrographic pattern is in many re- 
spects like that of the waking S, the 
behavior of S, as well as certain physio- 
logical measures, indicate that S is 
asleep. Jouvet points out that at the 
outset of S-LVF there is a marked 
atonia; the head drops, and the elec- 
tromyogram (EMG) of the nuchal 
muscles of the neck is flattened. As 
compared with the S-SW, respiratory 
movements during S-LVF are irregu- 
lar, and the cardiac cycle may show 
either a distinct acceleration or a dis- 
tinct deceleration, Eye movements are 
lateral and vertical, rapid and “explo- 
Sive." There is twitching of the tail, 
vibrissae, and ears. Fast sleep always 


follows slow sleep and does not arise 
first. 


NEURAL MECHANISMS OF SLEEP 


Further comparisons of the slow and 
fast phases of sleep reveal that: (a) 
The threshold of arousal by either mid- 
brain RF or peripheral sensory stimu- 
lation is appreciably raised during 
S-LVF; (b) evoked potentials are 
lowered during this period; and (c) 
S-LVF occupies 30-40%, while S-SW 
Occupies 60-70% of the sleeping time 
of the cat. 

Having described the major charac- 
teristics of sleep in both the slow and 
fast stages, Jouvet then proceeded to 
determine the anatomical loci responsi- 
a for these effects. Concerning the 
ocus of slow sleep, Jouvet found that 
total neodecortication prevented the ap- 
Pearance of S-SW, but did not inter- 
fere with the occurrence of S-LVF. 
a there was any cortex remaining, then 
S-SW appeared. Concerning the locus 
of S-LVF, Jouvet found that sections 
in front of the pons prevented its ap- 
Pearance without preventing the ap- 
parante of S-SW, while sections be- 
"Ind the pons did not interfere. with 
cither the slow or fast phase of sleep. 
‘casoning that the structure mediating 
Sota was in pons, Jouvet made dis- 
ns electrolytic lesions in various C€ 
esses within the pontine tegmentum. 
n e found that lesions of RPC dimi- 
a S-LVF, but did not affect S-SW. 
d these studies it was suggested 
nA S-SW is brought about by cortica 

uences, and that sleep wit desyn- 
chronized EEG, or rhombencephalic 
«€ (S-LVF), is triggered by 54 
: J Ouvet also performed several lesion 
qaectiments in which he obtained x 
iara against the notion that [bci 
modu ronization seen during E This 
esi lated through midbrain RF. : zw 
isl Ons in dorsal tegmentum tha ma 
3 Ted the classical EEG arousa r à 
ite to nociceptive stimuli did no 
lish the desynchronizing action seen 


ig S-LVF. Touvet has further 
'Own that lesions in ventral tegmen- 
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tum, posterior hypothalamus, inter- 
peduncular nucleus, and septal region 
may reduce or eliminate S-LVF. All 
of these structures are included within 
the limbic midbrain circuit of Nauta 
(1958). Thus Jouvet has attempted 
to make a distinction between the de- 
synchronization seen during arousal 
and the desynchronization seen during 
sleep. He has also stated that the 
EEG of these two states looks different 
to the “trained eye.” 

These findings may be related to 
those of Olds and Peretz (1960) and 
Glickman, Sroges, and Hunt (1964). 
The former found greater self-stimula- 
tion from midbrain RF electrodes lo- 
cated in the ventral region than in the 
dorsal region, and greater lower thresh- 
old arousal responses from more lat- 
eral and dorsal areas. Glickman et al. 
(1964) found that lesions in the ven- 
tral area increased activity markedly, 
while lesions in the more dorsal areas 
had no such effect. While it is not 
clear what common element 
t S-LVF; self-stimulation, and 
exploratory behavior, the consistent 
differences between the dorsal and ven- 
tral RF with reference to these be- 
haviors make the investigation of this 
problem worthwhile. As a possible 
working hypothesis one could assume 
that all three behaviors are related to 
some reinforcing Lotti That 

£ may be reinforcing 15 sug- 
a by ^ irritability of Ss de- 
ead of Tit A (Dement, IW: 
Along these lines, it is of interest to 
speculate whether septal rage (Brady 
& Nauta, 1953) may be at all ascribed 
to deprivation 0 S-LVF. These dd 
ibilities have not been investigated. 
si Two major conclusions can be 
Pec] from the Jouvet experiments. 

i it would seem fairly safe to say 

First, 1 deeper stage of sleep 


-LVF isa 
that S-LV. Second, it appears that 


active process 


as yet 
exists i 
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rather than a passive deafferentation. 
With respect to the first conclusion, 
Winters (1964) has suggested that 
S-LVF represents a stage of distrac- 
tion rather than deep sleep and is thus 
intermediate between wakefulness and 
S-SW. Winters has pointed out that 
the major claim of Jouvet is the de- 
creased responsiveness to peripheral 
stimuli during S-LVF. According to 
Winters, this decrease may be caused 
either by an increased excitability of 
neurons (occlusion) or a decreased 
excitability. Since Jouvet has not 
clarified this point, he cannot make 
inferences about the state of depth of 
sleep. It seems that Winters is as- 
suming that depth of sleep is directly 
related to frequency of unit firing. 
There does not seem to be any evi- 
dence to warrant this assumption, 
Also, Jouvet stimulated midbrain RF 
directly and compared arousal thresh- 
olds at various stages of sleep. Winters 
does not mention these findings. Win- 
ters has also Pointed out that the dis- 
appearance of the EMG is not sufficient 
evidence to define depth of sleep. 
Rossi and his Co-workers (Candia, 
Favale, Guissani, & Rossi, 1962) have 
shown a marked drop in blood pressure 
during S-LVF.  Giaquinto, Pompei- 
ano, and Somogyi ( 1964) have shown 
a complete abolition of the hetero- 
nymous afferent reflex and the reduc- 
tion of several other spinal reflexes 
during S-LVF. These effects do not 
occur as markedly in S-SW. In addi- 
tion, petit mal epilepsy is depressed 
during S-LVF (Cadilhac, Vlahovitch, 
& Delange, 1965), suggesting that this 
stage of sleep may be. deeper than 
S-SW. Thus, while Winters (1964) 
may be correct in saying that EMG 
disappearance or (if he knew of these 
data) blood-pressure drop or spinal 
reflex depression is insufficient evi- 
dence to demonstrate the depth of 
sleep, it would appear that all three 


lines of evidence taken together 
strongly favor the view that S-LVF 
is a deeper stage of sleep than S-SW. 
The work of Jouvet and his major con- 
clusions have been supported by the 
reports of Hubel (1960), Rossi, Fa- 
vale, Hara, Guissani, and Sacco 
(1961), and Moruzzi (1963a). See 
also the reviews by Cordeau (1962) 
and Moruzzi (1963b, 1964). 

The second major conclusion is that 
the induction of sleep is probably an 
active process. The work of Hess 
(1957) and others (Rossi & Zanchetti, 
1957, pp. 387-390) had shown that 
sleep could be induced by stimulation 
of particular brain loci. These studies 
were taken to be evidence for an active 
sleep system. It has remained a prob- 
lem, however, to obtain this effect with 
frequencies greater than 15/second. 
It has therefore been argued that the 
induction of sleep is not related to the 
area stimulated but rather to the char- 
acteristic of the stimulus frequency. 
While this argument remains cogent 
for that body of evidence, it cannot 
refute the data of Jouvet which show 
the absence of sleep when certain struc- 
tures are destroyed, Also, stimulation 
at high frequencies in pontine struc- 
tures can bring about sleep (Jouvet, 
1962; Rossi et al, 1961). It would 
appear quite safe to conclude that sleep 
is brought on by an active process. 
The next group of studies that we will 
consider will show some of the ana- 
tomical substrates underlying the syn- 
chronizing and desynchronizing pat- 
terns seen in the EEG. 


Work or MORUZZI AND THE 
Pisa Group 


At about the same time as Jouvet 
Published his work on the sleep pat- 
terns of the cat, Batini, Magni, Pales- 
tini, Rossi, and Zanchetti (1959), Ba- 
tini, Moruzzi, Palestini, Rossi, and 
Zanchetti (1959), and Batini, Palestini, 
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Rossi, and Zanchetti (1959) presented 
their work on the midpontine pretri- 
geminal preparation. A preliminary 
report was published in this country 
(Batini, Moruzzi, Palestini, Rossi, & 
Zanchetti, 1958). The major tech- 
nique employed by these workers is 
similar to that of Bremer—namely, 
transection of the brain at various 
levels, and analysis and comparison 
of the EEG related to the various 
transections, They have found that 
transections in front of the trigeminal 
nerve give either persistent desyn- 
chronization or synchronization (spin- 
dles and SW) depending on the angle 
of the transection. Thus in the rostro- 
pontine pretrigeminal preparation, the 
EEG is predominantly synchronized, 
While in the midpontine pretrigeminal 
Preparation the EEG is desynchronized. 
t may be asked whether the desyn- 
Chronization seen in the midpontine 
Preparation indicates arousal oT an 
animal in S.LVF. Several experi- 
ments (e.g., Affanni, Marchiafava, 
Zernicki, 1962a, 1962b; King & Mar 
chiafava, 1963) show conclusively 
Is awake. 
iouis results hav F 
M ications, First, it had P 
n assumed that the 


that 


e several important 
reviously 


tran Capacity to arouse is retained mi d 
Hier Con is made rostral to the tere he 
boe at which time the sleep-stat pan 
Somat, S0lé is induced. Clearly, therefor 
Bowes sensibility from the head is aUe 
tati erful contributor to tonic reticu ar ae 
on than are all other input SYS 


[French, 1960, p. 1288]. 


The work of Batini, Magni. 
Ossi, and Zanchetti (19 js jue 

hei (igi pes =a and Zan 

59), an atini, 

rossi, and eia (1959) has shown 
Des it is possible to achieve per: we 
Mae of desynchronization 

flerentation of the trigen 

is does not exclude the P 


nal nerve. 
ossibility 
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that the trigeminal contributes to the 
activation of RF, but trigeminal input 
is certainly not necessary for EEG 
desynchronization patterns. In fact. 
Batini, Palestini, Rossi, and Zanchetti 
(1959) have shown that while desyn- 
chronization in the midpontine pretri- 
geminal preparation disappears when 
the other two sensory inputs (optic, 
olfactory) are severed, even in this case 
desynchronization patterns begin to ap- 
pear after 2 or 3 days. These authors 
have concluded that in the “midpontine 
pretrigeminal preparation EEG activa- 
tion patterns are independent of the 
sensory inflow [p. 32].” 

The above findings suggest revisions 
ticular theorists since it has been 


for re s 
repeatedly emphasized that EEG acti- 
vation is brought about by sensory 


inputs and is maintained by cortico- 
fugal fibers to RF. But Morillo and 
Baylor (1963) have found that the 
medial lemniscus, spinothalamic tract, 
and spinocerebellar tract do not give off 
collaterals to the midbrain RF. Inas- 
much as these major sensory pathways 
have, at times, been claimed to do so, 
much work must be done to clarify 
exactly what inputs reach the RF and 
to what extent such inputs innervate 
this structure. - 
Tt has been mentioned that one im- 
lication of the work of Batini, Magni, 
Palestini, Rossi, and Zanchetti (1959) 
was the demonstration of the noncriti- 


re of the trigeminal in the de- 
en of the EEG. The 


synchronization n 
"uestion of what structure maintained 
‘he persistent desynchronization still 


remains. Since a section at the mid- 
prain pontine border oromni 
Jed to synchrony» and a lesion at : ` 
middle of the pons (midpontine) S 
to persistent desynchrony, it wou 
the tissue between these two 
necessary for the persistent 
If this were so, it 


nization. sw 
a clear whether this tissue rep- 


seem that 
cuts was 


488 ARYEH ROUTTENBERG 


resents (a) an extension of the mid- 
brain RF, or (b) a separate structure 
independent, to some extent, of the 
midbrain RF. If the former, then the 
obtained result could be explained on 
the basis of mass action; that is, a 
certain amount of midbrain and pontine 
RF is necessary to desynchronize cor- 
tex. Removal of some of this structure 
will decrease its ability to desynchro- 
nize. If the latter, it may be possible 
that the pontine structure is more 
critical than midbrain RF in maintain- 
ing wakefulness. The pontine struc- 
ture which has been tentatively identi- 
fied is n. reticularis pontis oralis. 
Before continuing it is necessary to 
point out a persistent problem with 
lesion research, particularly experi- 
ments in which entire sections of the 


brain are prevented from influencing 
one another. 


DENERVATION S UPERSENSITIVITY 


Psychologists and physiologists have 
generally ignored the problem set forth 
by Stavraky (1961). When a brain 
structure is removed by an experimen- 
tal or vascular lesions, the structures 
which were innervated by the removed 
structure will change their properties, 
The change in property will usually be 
in the form of an increased chemical 
sensitivity to other incoming stimuli 
(either naturally or experimentally ap- 
plied). This finding has been demon- 
strated for peripheral structures (Can- 
non & Rosenbleuth, 1949), but it has 
not been conclusively shown that de- 
nervation supersensitivity exists in the 
central nervous system. Sharpless and 
Halpern (1962) reported that after 
several weeks of little spontaneous ac- 
tivity chronically isolated cortex some- 
times showed EEG patterns which “re- 
sembled that recorded from normal cor- 
tex in waking animal [p. 246]." These 
results, while only suggestive, make 
possible another interpretation of the 


Batini, Palestini, Rossi, and Zanchetti 
(1959) finding that cortex can still 
show a desynchronized pattern without 
sensory inputs. Perhaps the desyn- 
chronization seen 2 days postopera- 
tively in the midpontine pretrigeminal 
totally deafferented preparation is not 
due to the intrinsic activity of the RF 
alone, but rather is potentiated by a 
generalized supersensitivity resulting 
from denervation of all sensory inputs 
to the brain. Whatever the state of 
affairs, it is quite clear that the impor- 
tance of denervation supersensitivity 
has not been adequately realized (but 
see Harvey & Lints, 1965), perhaps 
because it can make the interpretation 
of any neurophysiological experiment 
in which a lesion is made extremely 
difficult. There is little data on this 
issue. Perhaps subsequent experiments 
will take this phenomenon into account. 


SYNCHRONIZING MECHANISMS 


, In the Batini experiments conducted 
in 1959, it may not only be fruitful to 
ask what mechanisms are directly re- 
sponsible for the presence of desyn- 
chronization, but it may also be worth- 
while to ask what mechanism might 
underlie the persistence of desynchro- 
nization. If it were assumed that there 
Was a synchronizing mechanism in cau- 
dal pons that inhibited rostral pontine 
and midbrain RF, then one could ex- 
plain the results of the Pisa group by 
assuming that in the midpontine pre- 
trigeminal Preparation, the synchroniz- 
ing mechanism is cut off from the 
activating structures in pons and mid- 
brain. Thus, persistent desynchroniza- 
tion would be obtained by release from 
inhibition. This view is supported by 
different workers in different labora- 
tories using a variety of techniques. 
The main conclusion that emerges is 
that there is a powerful inhibitory in- 
fluence being exerted hv a synchroniz- 
Ing mechanism located in caudal pons 


A 
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and perhaps rostral medulla. Also, the 
presence of such a mechanism rein- 
forces the suggestion made earlier that 
Ma is brought about by active in- 
hibitory mechanisms and essentially is 
not occurring passively. 

The first major suggestion that there 
was a synchronizing influence on corti- 
cal activity was made by Cordeau and 
Mancia (1958, 1959). They studied 
the EEG of cats that sustained mid- 
brain hemisections. These authors 
found that in the awake cat there was 
greater EEG synchronization in cortex 
on the same (ipsilateral) side as the 
midbrain section as compared to the 
ee side, During the first 
ew postoperative days, when S was 
Dip to sleep by pentobarbital or went 
in sleep naturally, synchronization or 
Sleep spindles appeared first on the side 
ipsilateral to the lesion. After 
he ere days, the asymmetry of 
ig EG in the two hemispheres v 
is he awake S was gone. When 
E went to sleep, the sleep spindles 
tral appeared on the opposite side (con- 
E ateral) of the midbrain hemisection: 
a same experiment was perfor i 
e n the addition of a pretrigemima 

lidbontine section, all aspects of the 


= result were repeated pd 
or failure of the sleep spindles ot the 
the 


De; : 
M on the contralateral side O 
idbrain hemisection after 6-8 post- 


ee days. Cordeau and Mancia 
Uggested that these results favored the 
p system 


ee of a synchronizing slee 
"ch exerts its effect behin 
en tine section. The reason 
o ven occurred in the locus 
tio Spindles in the midbrain 
P ned S is likely the result © 
hi. y action of the DEUS ion, 
yeu the synchronizin£ fe as à 
ar the midpontine pretrigemina prep- 
ation, on the other hano the com 
Pensatory switch is not see? postopera” 


iy : € : 
ely, since the mechanism producing 


of onset 
-hemisec- 
a com- 


these spindles was severed from the 
rest of the brain. In order to explain 
why spindles are produced at all in 
the pretrigeminal preparation, it is as- 
sumed that the thalamic spindle pace- 
maker (Morrison & Dempsey, 1943) 
does operate in the absence of the lower 
brainstem synchronizing influence. 
Magni, Moruzzi, Rossi, and Zan- 
chetti (1959) have demonstrated 
arousal reactions by the application of 
the barbiturate thiopental. These 
workers were able to limit the distri- 
bution of the drug to the lower brain- 
stem by ligation procedures which did 
not interrupt the vertebral or carotid 
circulation. Under these conditions 
the application of thiopental produced 
a desynchronized response in the EEG. 
A similar application of the same drug, 
but allowing its distribution throughout 
duced the expected EEG 


the brain, pro 
synchronization. These authors inter- 


reted their results as supporting the 
view that a synchronizing structure is 
located within the lower pons and ros- 
tral medulla. Application of thiopental 
depressed the activity of this structure, 
thereby releasing the pontine and mid- 
prain RF from its inhibitory influence, 
and bringing about a desynchronized 
response. That the lower brainstem 
was, in fact, depressed is revealed by 
of the corneal reflex. These 


the absence 
results confirm those of Cordeau and 
Mancia (1958, 1959) and support the 


notion that there exists a synchroniz- 
ing mechanism in the lower brainstem. 
Hugelin, Bonvallet, and their co- 
workers (Bonvallet & Allen, 1963; 
- Bloch, 1961; Dell, Bon- 
, 1961) have shown 
h cortical and lower 
ffects acting on the mid- 
f Moruzzi and Magoun 
arizing their re- 
rocedure which depressed 
for example, decortica- 
all doses of chloralose, 


that there are bot! 


brainstem € 
F o 
Briefly summ 


tion, cooling, SI 
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hyperventilation, increased the excita- 
bility of RF as evidenced by the greater 
reticular facilitation of the masseteric 
myotatic reflex. From their findings 
they postulated a negative-feedback sys- 
tem in which excitation of RF of the 
midbrain leads to cortical arousal which 
then dampens reticular activity. These 
results may be contrasted with the 
facilitatory positive-feedback system 
which French, Hernández-Peón, and 
Livingston (1955) described. These 
latter workers showed that stimulation 
in cortex produced reticular arousal 
and suggested that the cortex is impor- 
tant in the maintenance of arousal. 
Since it is quite possible that both in- 
hibition and facilitation of RF by 
cortical structures may be present, the 
conflicting results of the Parisian and 
Californian workers may be only ap- 
parent. 

Of more importance to the present 
discussion is the review of the Bon- 
vallet and Bloch (1960) study by Dell 
et al. (1961). These authors showed 
that the interruption of the pathways 
between the RF of the midbrain and 
bulbar structures increased the in- 
tensity and duration of cortical arousal 
produced by afferent nerve stimulation. 
Also, stimulation in RF produced a 
much greater arousal in the prebulbar 
sectioned animal than in the encéphale 
isolé preparation. These data likewise 
lend support to the notion of a syn- 
chronizing system in lower pons and 
rostral medulla. 

It is not clear, however, how this 
synchronization is brought about, Dell 
et al. (1961) consider two possibilities. 
Either this synchronizing mechanism 
exerts its effect on the thalamic syn- 
chronizing system or it exerts its effect 
by deactivating RF. Dell et al. favor 
the latter view, but it is not at all 
necessary to reject the former view in 
the process, since it is quite likely that 
one result of reticular deactivation is 


AnvEH ROUTTENRBERG 


the partial release of the synchronizing 
system of the thalamus from the over- 
powering effects of reticular stimulation 
(Purpura & Shofer, 1963). 

Bonvallet and Allen (1963) have 
performed a series of experiments to 
determine the location of the synchro- 
nizing structure and whether its effect 
is mediated by deactivating RF or by 
activating synchronizing structures. 
These authors have shown that dis- 
crete lesions of the cephalic portion of 
the n. tractus solitarius are followed by 
a general instability of reticular activ- 
ity in sympathetic, parasympathetic, 
motor, and cortical facilitation func- 
tions. These findings suggest a loss 
of control of RF function by the elimi- 
nation of a homeostatic mechanism 
in medulla and strongly implicate 
the n. tractus solitarius as the criti- 
cal synchronizing mechanism. These 
authors also showed by other lesion 
experiments that the effects obtained 
could not be duplicated by destruc- 
tion of the classical bulbar inhibi- 
tory RF described by Magoun and 
Rhines (1946). Thus, while the area 
most clearly implicated in the latter ef- 
fect has been the ventromedial RF, Bon- 
vallet and Allen (1963) found that the 
“synchronizing” structure was located 
more laterally. They have localized the 
effect to the n. tractus solitarius at the 
level of the dorsal motor nucleus of 
the tenth nerve and to those cells lying 
between this area and the floor of the 
fourth ventricle, They have also 
shown that the locus of the synchroniz- 
ing effect is distinct from the nearby 
bulbar region regulating blood pres- 
sure, but may be involved with afferent 
fibers from the cardiovascular system. 

Another example of inhibitory in- 
fluence from lower structures will be 
given, primarily because the conclu- 
Sions are somewhat similar to those 
reported above, but the neurophysio- 
logical technique used is quite different. 


A, 


s 
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Demetrescu and Demetrescu (1962) 
studied the effects of reticular stimula- 
tion on potentials evoked in visual 
cortex by electrical stimulation of the 
lateral geniculate. They found that 
stimulation in dorsal areas of RF en- 
hanced visual evoked potentials, but 
stimulation in ventral pontine RF had 
no effect. If lesions were then made 
in the central gray of the midbrain, 
stimulation of ventral RF inhibited 
the thalamo-cortical evoked potential. 
This study suggests that there is an 
inhibitory system located in ventral 
pons which exerts an effect that is 
Opposite in sign to that shown by the 
dorsal and lateral midbrain tegmentum. 
A recent report by Rossi, Minobe, 
and Candia (1963) has dealt with the 
Problem of identifying the site of origin 
of the ascending influence responsible 
for the EEG patterns of deep sleep. 
They have performed brain hemisec- 
tion experiments similar to those of 
Cordea and Mancia (1958; 1959). 
Rossi et al. (1963) have compared 
EG patterns in cats hemisectioned at 
Various levels of the brainstem during 
Slow and fast sleep. Of particular 1m- 
terest to our discussion on synchrom2- 
ing mechanisms is the finding 
Was no asymmetry in the 
hemisections at the caudal 
rostral medulla, suggesting ie 
synchronizing system is probably o 
cated at the pons-medulla boundary re- 
gion. These authors showed that hemi- 
Section of the midbrain produced js 
Increase in S-SW and a decrease js 
S-LVF. When the hemisectó" was 
Placed behind n. reticularis PE 
oralis, there was an increase i” S-L 
and a decrease in S-SW. PC 
lesion was placed at the level of RP 
se was a decrease of te 
Crease in S-SW. Accordin 
*t al. (1963), then, both slow ms 
Sleep were apparently controlled by 


system located within the pons, and it 
thus appeared that: 


... the integrity of the rostral part of the 
pons is essential for the maintenance of the 
waking state, and the middle and caudal parts 
of the pons play a very important role in 
the occurrence of sleep. A different rostro- 
caudal distribution of the arousing (rostral 
pons) and sleep-inducing (middle and caudal 
pons, and perhaps medulla oblongata) neu- 
rons seem proved [p. 487]. 


These authors stated that there ap- 
pears to be an active and passive origin 
of slow sleep and an active origin of 
fast sleep. In the former case, the pas- 
sive origin refers to the deafferentation 
of midbrain RF at the first stages of 
sleep, while the active origin of slow 
sleep refers to the inhibitory action of 
pontine synchronizing structures on 
pontine and perhaps midbrain RF. 

There are several difficulties in inter- 
preting the Rossi et al. (1963) study. 
The one difficulty that appears most 
outstanding is the possibility that sec- 
tions at certain levels of the brainstem 
may be eliminating both synchronizing 
and desynchronizing systems or syn- 
chronizing systems and systems con- 
cerned with the mechanisms of S-LVF. 
In such cases, it would seem difficult 
to make any definite statements con- 
cerning functional localization. Subse- 
quent studies should be performed us- 
ing discrete stereotaxic lesions. 

Summarizing this section on synchro- 
nizing systems, it would appear certain 
that there are important structures in 
the caudal pons that play a role in 
synchronizing the EEG during sleep. 
Further studies should clarify the rela- 
tion between this brainstem synchro- 

"ung system and the synchronizing 
non d in thalamus (Morrison 
system foun 

1943). 
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made between the state of conscious- 
ness of the organism and brain activity 
recorded presumably from individual 
neurons by means of microelectrodes 
.5-10 microns in diameter at the un- 
insulated tip. From these experiments 
it has been shown that, in general, 
single-unit activity is at a high level 
during wakefulness, at a low level dur- 
ing S-SW, and at a high or higher 
level during S-LVF than during wake- 
fulness. This result has been shown 
with neuronal activity derived from cor- 
tex (Evarts, 1962; Murata & Kameda, 
1963), pyramidal system (Arduini, 
Barlucchi, & Strata, 1963; Evarts, 
1964), and from vestibular nuclei 
(Bizzi, Pompeiano, & Somogyi, 1964). 
Increases in single-unit activity in 
RF during S-LVF have, in fact, been 
demonstrated by Huttenlocher (1961) 
and Brooks and Bizzi (1963). 

It appears quite remarkable that the 
individual elements of the central nerv- 
ous system are as active during wake- 
fulness as during S-LVF. One possi- 
bility suggested by Evarts (1964) is 
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Fic. 1. Possible brainstem organization 
involved in sleep. (White arrows denote 
facilitation, striped arrows, suppression.) 


that the patterning of neuronal activity 
is different during wakefulness than 
during S-LVF. Thus, frequency of 
unit firing would be similar . during 
wakefulness and during S-LVF, but 
the distribution of activity (e.g., inter- 
spike interval, variance) would be dif- 
ferent. In this regard, the analysis of 
thalamic-unit activity by Werner and 
Mountcastle (1963) may perhaps be 
fruitfully applied to the comparison be- 
tween neuronal firing during wakeful- 


ness and neuronal firing during S- 
LVF. 


ORGANIZATION OF RETICULAR SLEEP 
MECHANISMS 


An attempt will be made in this sec- 
tion to bring together the results dis- 
cussed up to this point into a schema 
of brainstem function during sleep. 
The basic system is summarized in 
Figure 1. Certain features of this 
schema should be pointed out. First, 
it is assumed that there are two ana- 
tomically distinct desynchronizing sys- 
tems, which we shall call Arousal Sys- 
tem I and Arousal System II. System 


: l is closely related to the classical 


arousal system originally described by 
Moruzzi and Magoun (1949) and is 
generally thought of as the arousal sys- 
tem. System II is part of the limbic- 
midbrain system described by Nauta 
(1958) in the cat. Although arousal 
as a unitary phenomenon has been 
used fruitfully in neuropsychological 
theory (Hebb, 1955), the present data 
as well as other findings (e.g., Feld- 
man & Waller, 1962) appear to neces- 
Sitate the postulation of two arousal 
Systems. Since neocortical desynchro- 
nization does not differentiate between 
the activity of Systems I and II, it is 
Important to determine whether some 
other measure can differentiate between 
desynchrony produced by System T and 
desynchrony produced by System Il. 
It appears that electrical activity in 
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hippocampus may serve such a need. 
It will be recalled that Green and 
Arduini (1954) showed that stimula- 
tion of RF produced 4-7 cycle/second 
waves in hippocampus. These au- 
thors identified this hippocampal theta 
rhythm with arousal. Recently, how- 
ever, it has been shown that stimulation 
in midbrain RF can also produce LVF 
(desynchronization) in hippocampts 
(Stumpt, 1965). This result suggests 
the possibility that neocortical EEG 
desynchronization may indeed be di- 
vided into two states: one identified 
with hippocampal desynchronization 
and another identified with hippocam- 
pal theta. The former, in the present 
schema, is derived from reticular- 
activating System I, the latter, from 
limbic-midbrain System II (Kawa- 
mura, Nakamura, & Tokizane, 1961). 
A second feature of the schema is 
that hippocampal theta tends to dampen 
the activity of System I. This view 1s 
primarily based on the work of Gras- 
tyan (1959), who has interpreted his 
: results to mean that hippocampal ac; 
tivity represents an inhibition of the 
orienting reflex to novel stimuli. 
third feature of the schema is that a 
positive-feedback loop exists between 
System I and neocortex. Such a proc- 
ess has been suggested by French et al. 
(1955). Based on the above view that 
hippocampal theta dampens the activ- 
ity of System I, and that neocortex and 
System I form a positive-feedback sys- 
tem, the following hypothesis may be 
stated: At low or intermediate levels 
of activity, System II will predominate 
over System I since the suppress 
effect of hippocampal theta will ten e 
diminish the positive-feedback effect be- 
tween System I and neocortex. ^ 
high levels of activation, System I wi 
predominate since the feedback system 
will be in too great à state of activity 
to be suppressed by hippocampal theta. 
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From the present review of the sleep 
literature, there appear to be two criti- 
cal structures in brainstem involved in 
the sleep process. The first structure 
is the synchronizing mechanism which 
has been tentatively identified with the 
rostral aspect of the n. tractus soli- 
tarius. The second structure is the 
S-LVF mechanism which has been 
shown by Jouvet (1962) and others 
(see Hobson, 1965) to be associated 
with n. reticularis pontis caudalis. 
With respect to the synchronizing 
mechanism, the present schema assumes 
that both Arousal I and Arousal II 
are inhibited by this system. Accord- 
ing to the present view, the midpontine 
section. of Batini, Magni, Palestini, 
Rossi, and Zanchetti (1959) separates 
the synchronizing system from Arousal 
I and Arousal II, thus causing a re- 
lease from the inhibition of the syn- 
chronizing system. If it were assumed 
that n. reticularis pontis oralis repre- 
sents a critical component of the posi- 
tive-feedback system, then one might 
understand how a rostropontine section 
might yield significantly less neocortical 
desynchronization. 

It remains finally to account for the 
relation of the S-LVF mechanism 
(RPC) with the system thus far de- 
scribed. The assumption that satisfies 
most of the demands of the data is that 
RPC inhibits the synchronizing mech- 
In such a case, inhibition of 
the synchronizing system would remove 
the inhibition from Arousal I and 
Arousal II and thus permit the latter’s 
activity. This activity would then ap- 
pear as desynchronization in the neo- 
cortical EEG and theta rhythm in hip- 
pocampus. If we assume that RPC 
directly or indirectly inhibits neck- 
muscle EMG, vasoconstrictor tone, 
sudomotor activity, and spinal reflexes, 
then the present schema has achieved a 
representative description of what has 


anism. 
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been called “paradoxical sleep,” that is, 
S-LVF. : : 

Two comments concerning possible 
cortical relations with RPC and possi- 
ble relation of this model to behavior 
in the waking state will be made. Pre- 
sumably, if telencephalon is critical for 
the onset of S-SW (Jouvet, 1962), and 
if RPC is important in S-LVF » it 
would be useful to know what relation, 
if any, exists between them. Newman 
and Wolstencroft (1959) have re- 
corded short-latency responses of 1 
millisecond in medulla resulting from 
stimulation of the orbital gyrus in 
frontal lobe. This result was also 
obtained by these authors using 
the strychnine neuronography method. 
Earlier, Sachs, Brendler, and Fulton 
(1949) had noted strychnine spikes in 
medial forebrain bundle (MFB), and 
Nauta (1962) has recently traced 
fibers from the posterior orbital area 
to MFB. Auer (1956) has also noted 
terminal degeneration in MFB follow- 
ing frontal cortex ablation. No de- 
generation was seen in the ventrome- 
dial nucleus of the hypothalamus or in 
mammillary nuclei. "These studies sug- 
gest that the orbital gyrus has a strong 
relation to the limbic-midbrain System 
described by Nauta (1958). It will be 
recalled that this circuit was implicated 
strongly by Jouvet as important in 
S-LVF. " 

The second comment concerns possi- 
ble use of the schema in the waking 
state. It should be pointed out, first, 
that the process of sleep seems organ- 
ized on a hierarchical basis. Consider- 
ing four states, aroused, relaxed, S- 
SW, and S-LVF, transitions from one 
stage to another by proper stimulation 
can be demonstrated. Thus, several 
workers have been able to produce 
relaxation and then S-SW with low- 
frequency stimulation (Pompeiano & 
Swett, 1962; see Rossi & Zanchetti, 
1957, for review) or with chemical 
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stimulation ( Hernández-Peón, Chavez- 
Ibarra, Morgane & Timo-Iara, 1963). 
The reverse operations have also been 
shown by Lissak, Karmos, and Grast- 
yan (1963) ; they were able to produce 
sleep with slow waves following S- 
LVF by stimulation in hippocampus. 
The tentative conclusion may be made 
that in the intact and properly func- 
tioning organism, there is a successive 
reduction in the locus of control of the 
conscious state. As Jouvet has pointed 
out, S-LVF is always preceded by 
S-SW. Thus it would appear that the 
difference between the sleep and wak- 
ing state may be a function of the 
System which is in primary control and 
in the overall pattern of nervous activ- 
ity in central and peripheral structures, 
rather than total overall activity alone. 
This means that any single measure 
will probably never describe the state 
of consciousness of the organism. It 
would appear that at least two meas- 
ures will be required. 

Having discussed the problems with 
inferring consciousness from a single 
measure, it now may be fruitful to con- 
sider the possibility that the synchro- 
nizing and desynchronizing mecha- 
nisms seen during sleep may be 
operating during wakefulness. For ex- 
ample, it has been shown by Clemente, 
Sterman, and Wyrwicka (1964) that 
immediately following reinforcement 
the EEG of the Cat is synchronized. 
This pattern of Synchrony in the awake 
animal may have its derivation from 
either thalamic or brainstem synchro- 
nizing structures or both. We have 
no data on this point. Since it has 
been shown that both synchrony and 
desynchrony appear in both sleep and 
wakefulness, perhaps our understand- 
ing of the neural mechanisms of the 
sleep process will allow for insights 


into the neural mechanisms of waking 
behavior as well, 


c 
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CERTAIN CHANGING VIEWS OF 
RF Function 


, Consideration of the neural mecha- 
nisms underlying sleep suggests that 
there may be a necessity to modify 
Current views of RF function. 

l. The brainstem organization im- 
portant in sleep and wakefulness is 
extremely complex. The proposed 
Schema represents only the merest sug- 
Sestion of the complexity that is in- 
volved. The notion that sleep results 
from midbrain RF inactivity is at best 
a simplification, and at worst, incor- 
Tect (Moruzzi, 1963b). Whatever the 
Correct organization of the system may 
be, it is clear that the view of the 
Single, unified RF that is either active 
(wakefulness) or inactive (sleep) is 
inadequate to account for the current 
data on sleep. 

2. LVF (desynchronization) seen on 
an EEG may represent high arousal, 
Moderate arousal, or deep sleep. 
Clearly this desynchronization should 
Not be termed an “arousal pattern 
in the absence of other correlated 
Measures, 

3. While a major emphasis has been 
Placed on the midbrain RF as the criti- 
cal structure in mediating arousal, the 
Work of Moruzzi, Rossi, and their co- 
Workers suggests that the pontine RE 
Plays at least as important a role in 


Ree and wakefulness as does midbrain 


4. Feldman and Waller (1962) 
have shown that certain lesions of the 
Tainstem caused an animal to be 
asleep or drowsy while producing a 
desynchronized EEG; other lesions 
rused a synchronized EEG pattern, 

ut the animal was quite awake: i 
Would appear that present concepts 0 

" function are inadequate to account 
Or such a result. The present schema 
Might be able to do so if it were 25. 
Sumed that one lesion was in Arousal 
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I while the other lesion was in Arousal 
II. Similar problems in interpretation 
occur with the work of Wikler (1952), 
who showed that atropine causes syn- 
chronized EEG patterns but leaves the 
animal awake. In this case, also, it 
may be possible that atropine differen- 
tially affects the two arousal systems 
so that wakefulness is maintained by 
one, while synchrony is brought about 
by the atropine-caused depression of 
the other. 

5. Finally, it may be pointed out that 
several traditional experimental find- 
ings and interpretations have been 
questioned by recent research. Thus, 
Schlag and Chaillet (1962) have 
shown that the diffuse thalamic projec- 
tion system does not directly cause neo- 
cortical desynchronization, but rather 
activates cortex by a descending route 
through brainstem RF. Baust, Nie- 
meczyk, and Vieth ( 1963) and Baust 
and Niemeczyk (1963, 1964) have 
shown quite conclusively that the adre- 
nergic arousal reported by Rothballer 
(1956) was a secondary effect caused 
by a primary response of an increase in 
blood pressure. Although by no means 
definite, Morillo and Baylor (1963) 
found no evidence that medial lemnis- 
cus projects into the brainstem RF. 
Such a result is opposite to the findings 
of Starzl et al. (1951). More work 
is required to resolve this issue. 
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Findings indicate that under most conditions nonrapid eye move- 
ment sleep (NREMS) represents an indispensable condition for the 


onset of rapid eye movement sleep 


sequential relationship are explored. 
ing NREM sleep involves a progre: 


which, nevertheless, 
appropriate limits. A homeostati 
then postulated. This invol 
tendency toward deepening N 
when it reaches a preset level, 
toward REMS and organism 
ing to increase cortical "tonus" 
afferentation.” 


Recent research has produced a 
rapidly expanding body of knowledge 
concerning two qualitatively distinct 
types of sleep: rapid eye movement 
sleep (REMS) and the quiescent sleep 
Stages referred to collectively as non- 
rapid eye movement sleep (NREMS). 
In spite of much that has been learned 
in the past several years about the 
characteristics of REMS and NREMS, 
however, the physiological function of 
REMS is not yet known, and to date 
little attention has been paid to pos- 
sible implications of the sequential re- 
lationship between these two forms of 
sleep; or to the significance of their 
markedly different—often contrasting 
—natures, for an understanding of the 
function of REMS. 

Should REMS be looked upon as a 
regressive state which atavistically in. 


1 A shorter version of this paper was read 
before the Association for the Psychophysi- 
ological Study of Sleep, Washington, D. ica 
March 26-28, 1965. The authors would like 
to express appreciation to Howard Roffwarg 
and Frederick Snyder for their critical read- 
ing of an earlier draft of this manuscript and 
to Wilse Webb for his helpful comments 
during preparation of the final version. 


(REMS). The implications of this 
It is conceptualized that deepen- 


ssive loss of cerebral vigilance 


must somehow be maintained within adaptively 


c interplay between sleep phases is 


ves 2 complementary tendencies; 1 
REM sleep, and organismic rest, which, 
triggers the release of another tendency 
ic activation. 


REMS is seen as act- 
through a process of "endogenous 


trudes during sleep, or does it serve 
Some current role in the economy of 
the organism? Does REMS operate 
on its own time clock independent of 
NREMS (which precedes it under 
most normal conditions) or may the 
Cyclic alternation of these two sleep 
forms reflect some homeostatic process 
at work? 

In this paper we propose to explore 
the relationship between the sleep 
phases, giving particular attention to 
evidence which suggests the presence of 
a homeostatic process underlying the 
NREMS-REMS cycle. Following a 
brief description of research and an ex- 
amination of some other views on the 
function of REMS, we will outline a 
theory of the function of this sleep 
state. Implications of this concept for 
further research will þe suggested. 


RESEARCH AND THEORY 


The NREMS-REMS Cycle 


Since several comprehensive reviews 
of the literature are available (Dement, 
1964; Dement, 1965a; Fisher, 1965: 
Hartmann, 1965; Snyder, 1965a. 
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1965b) we will recount only a few find- 
ings. By now it is well known that in 
1953 Aserinsky and Kleitman noticed 
that periods of highly active (REM) 
sleep appeared to alternate with periods 
of quiet (NREM) sleep in a regular, 
cyclical manner, and that the activated 
sleep phase was accompanied by rapid 
conjugate eye movements which occur- 
red in bursts (Aserinsky & Kleitman, 
1953). It has subsequently been de- 
termined that this rapid eye movement 
state is present in human beings of all 
ages (Aserinsky & Kleitman, 1953, 
1955) as well as in all mammalian 
forms studied to date (Klein, 1963; 
Snyder, 1965a, 1965b). In a typi- 
cal night’s sleep (in the human adult) 
REMS begins a little more than 
an hour after sleep onset, recurs 
roughly every 90 minutes throughout 
the night, and is accompanied by 
marked changes in electrical record- 
ings, corresponding to a cyclic vari- 
ation in sleep stages as measured by 
the electroencephalogram (EEG) (De- 
ment & Kleitman, 1957a, 1957b). 
Commonly, five stages of EEG sleep 
are identified. At sleep onset, 2 brief 
period of Stage 1 occurs, characterized 
by low voltage fast waves. This soon 
gives way to Stage 2, which is marked 
by the appearance of synchronous 
bursts of 14-cycles per second activity 
known as “sleep spindles” and/or by 
‘K Complexes.” Sleep then usually 
proceeds to Stages 3 and 4 C deep 
NREMS) which are characterized by 
slow, high voltage potentials. After 
some time in these "deeper" Stage» 
there is a return to Stage 1, which 1S 
how for the first time accompanied by 
rapid eye movements and other signs o 
intense activation which have sive? rise 
to its separate designation 2$ Stage 


1-REM. Thus in normal subjects 
(Ss), REMS regularly follows 
NREMS. This sequence of sleep 


stages is then repeated throughout the 
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night with minor variations (Dement 
& Kleitman, 1957a). 

Many distinct physiological changes 
are seen during the rapid eye move- 
ment state. While these are too nu- 
merous to list in their entirety, they 
include acceleration and irregularity of 
heart-rate (Snyder, Hobson, Morrison, 
& Goldírank, 1964); increase in and 
variability of systolic blood pressure 
(Snyder et al., 1964) ; shallow, rapid 
and irregular breathing (Snyder et al., 
1964) ; full or partial penile erections 
(in primates) (Fisher, Gross, & Zuch, 
1965; Gross, Weitzman, Fisher, & 
Byrne, 1966; Karacan & Snyder, 
1966); and increased brain tempera- 
ture in cats (Rechtschaffen, Cornwell, 
& Zimmerman, 1965) and rabbits 
(Kawamura & Sawyer, 1965). Neural 
changes suggesting marked activation 
of sensory receiving and motor areas 
of the brain also occur and will be 
discussed later. Paradoxically, in con- 
trast to numerous indications of in- 
creased neural and metabolic activity, 
resting muscle tone from head and neck 
virtually disappears during REMS 
(Berger, 1961; Jouvet & Michel, 
19602) and there is a sharp attenu- 
ation of spinal reflexes at this time 
(Hodes & Dement, 1964). 

Some findings indicate that REMS 
may represent a necessary physiologi- 
cal state since both human Ss (Dement, 
1960) and animals (Dement, 1965b; 
Touvet, 1965a; Khazan & Sawyer, 
1963) systematically deprived of 
REMS (by being awakened shortly 
after onset of the rapid eye move- 
ments) show increased REMS on sub- 


sequent nights, time spent in this sleep 
stage on "recovery nights (or number 
of awakenings required to maintain 
REMS deprivation) increasing in 
nearly linear fashion with. the number 

f such deprivation. 


of nights o " 
REMS appears related to dreaming 


in humans in such manner that dreams 
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are reported in approximately 80% of 
awakenings from it (Dement & Kleit- 
man, 1957a, 1957b). While investi- 
gators have obtained reports of mental 
content including dreams or “dream- 
like" experiences from all phases of 
sleep (Foulkes, 1962; Goodenough, 
Shapiro, Holden, & Steinschriber, 1959; 
Kamiya, 1961; Orlinsky, 1962; Recht- 
schaffen, Verdone, & Wheaton, 1963), 
if one were to concentrate solely on the 
type of dream which includes highly 
organized, imagery-laden plots or 
stories with relatively more reference 
to strong emotion, rapid eye movement 
periods appear predictive of this sort 
of dream. 


Theories Concerning the Function of 
REMS 


Most theories to date on the func- 
tion of REMS have been expressed 
briefly by authors reporting research, 
Speculations include Suggestions that 
REMS may act as à "rhythmic energy- 
feeding mechanism" Serving to main- 
tain the organism through the long fast 
of sleep (Mandell & Mandell, 1965) ; 
REMS may be the expression of a 
periodic function of storing informa- 
tion at the molecular level and thus 
play an essential role in memory 
(Jouvet, 19652) ; and REMS in pre- 
matures, newborns, and infants may 
correspond to a unique and ontogeneti- 
cally primitive state of vigilance ( Petre- 
Quadans, in press). É 

Although these and similar theories 
are provocative, the tentative form of 
their presentation has prompted us to 
confine our discussion to the more de- 
tailed hypothesis recently put forward 
by Roffwarg, Muzio, and Dement 
(1966). These authors have suggested 
that a functional consequence of REMS 
may be to assist structural maturation 
of the central nervous system ( CNS) 
through supplying great quantities of 
endogenous stimulation to the higher 
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centers at a time (in utero and dur- 
ing early postnatal life) when brain 
growth is maximum. They present 
arguments in support of the position 
that maturation is often dependent on 
function and point out that the ex- 
tremely high percentage of REM sleep 
seen at birth falls off with increasing 
age and that, interestingly, it is at this 
time that the CNS is receiving corre- 
spondingly longer periods of waking 
stimulation. 

The developmental theory of REMS 
proposed by Roffwarg, Muzio, and 
Dement has much to commend it and 
offers, in our opinion, the best explana- 
tion so far advanced for the much 
higher percentage of REMS seen in 
early life. This theory appears in- 
complete, however, in that it does not 
explain the continued occurrence of 
REMS throughout the life cycle. This 
limitation is noted by the authors, who 
suggest that REMS may fulfill some 
as yet unknown function of CNS stim- 
ulation even after maturation is com- 
pleted. In the present paper, an at- 
tempt will be made to explore the 
function of REMS within the perspec- 
tive of the organism’s total life span. 


The Concept of Cortical Tonus 


Head (1923) used the term “vigi- 
lance” to describe a state of “high-grade 


neurophysiological efficiency” which 
varies with the stat 


ment, the activity of a particular por- 
tion of the CNS exhibits signs of 
Integration and purposive adaptation. 

i ems directly appli- 
f sleep, some of its 


distinguished 
While the latter 
i$ inversely proportional to the strength 
itati d to produce any 
activity (that is, 
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to threshold value), the main factor in 
vigilance is the fundamentally diver- 
gent nature of the response. This 
would vary according to whether the 
organism is (at the time of stimulus 
presentation) in a high or low state of 
vigilance. A high state of vigilance 
1s said to tend to evoke more or less 
high-grade, adapted responses. 

_ , While Head’s concept appears prom- 
Ising for a comparative study of the 
Various sleep stages, the term “vigi- 
lance" is commonly used to refer to a 
State of watchfulness with respect to 
danger or hazard. It is also employed 
in an experimental sense, to denote 
the degree of responsiveness to external 
Stimuli found in the sleeping organ- 
ism. It would therefore be desirable 
to find a different term to denote vari- 
ations in the general level of efficiency 
or "tone" of the CNS. 

In 1938, Bremer referred to “corti- 
cal tonus," a condition characterized by 
Spontaneous electrical activity of the 
higher brain centers which was said to 
be maintained by corticopetal (affer- 
ent) impulses and thus to be automati- 
cally lowered with surgical or func- 
tional deafferentation. While it is not 
quite clear whether Bremer intende 
the term "cortical tonus" to refer to 
heurophysiological efficiency as well as 
Neuronal activation, the term appears 4 
Useful one, and will be clarified for use 
In the present discussion. 

e employ “cortical tonus” to de- 
Note a level of tonic CNS excitation 
Suitable for efficient processing of a 
formation, limiting the term MBE: 
lance” to denote responsiveness to = 
ternal stimuli only. While cortica 
tonus and vigilance so defined will of- 
ten covary, they need not always do 
So. The organism could be said to bt 
1n a state of heightened cortical tonus, 


debt o 

Or example, when it 1S ie pud 
to stimuli which are entirely endog 

when 


Sus in nature, as, for example, 
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organismically-generated afferent im- 
pulses or memory traces command at- 
tention. Since attentiveness to “en- 
dogenous afferentation” might, in this 
case, even impede responsiveness to 
external stimuli, heightened cortical 
tonus would then result in lowered 
vigilance. 

While a certain level of neuronal 
excitation must be present for adequate 
cortical tonus, supernormal CNS ex- 
citement such as occurs when body 
temperature is experimentally raised 
or an excess of oxygen is administered 
can lead to hyperirritability, loss of 
concentration and other symptoms of 
lowered CNS efficiency (Barcroft, 
1938). Accordingly, to establish the 
presence of adequate cortical tonus, we 
should find evidence of a heightened 
capacity for effective processing of in- 
formation, as well as indications of 
neuronal activation. In evaluating evi- 
dence for a homeostatic theory of 
REMS it therefore seems desirable to 
consider both behavioral and neuro- 
physiological data. In certain in- 
stances, however, data will be available 
in only the one or the other of these 
areas, thus limiting discussion to a 
single type of evidence. 


Cortical Homeostasis 

At the commencement of sleep, with 
descent into NREMS, cortical func- 
tioning is profoundly altered as the de- 
synchronized neuronal activity of wak- 
ing life gives way to synchrony. As 
sleep deepens there is à progressive 


ilance, although the organism 


loss of vigi : 
does not lose the capacity to respond to 


urgent stimuli (Coleman, Gray, & 
Watanabe, 1959; Fischgold, Schwarz, 
& Dreyfus-Brisac, 1959; Oswald, Tay- 
lor, & Treisman, 1960). Since vigi- 
lance is not entirely lost in normal 
sleep (presumably because of the high 
survival value attached to the capacity 
to respond while asleep to certain vital 
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challenges) it could be said that sleep 
is contained within adaptively appro- 
priate limits. 

While regulation of depths of sleep 
may seem self-evident, this is possibly 
only because we take for granted cer- 
tain mechanisms which control the 
level of cerebral excitation so effec- 
tively that a sufficient degree of corti- 
cal tonus is regularly maintained. It 
might be useful to explore this ques- 
tion and search for means by which 
such tonus may be insured in sleep, 
As Cannon (1929) points out: 


in an open system such as our bodies repre- 
sent, compounded of unstable material and 
subjected to continually disturbing condi- 
tions, constancy is in itself evidence that 
agencies are acting, or ready to act to main- 
tain this constancy [p. 424]. . . . When a 
factor is known which can shift a homeo- 
static state in one direction it is reason- 
able to look for automatic control of that 
factor or for a factor or factors having an 
opposing effect [p. 426]. 


Cannon was primarily concerned 
with mechanisms (neural, endocrine, 
and others) which perform autonomic 
regulations necessary for normal bodily 
function and health. More recently, 
this classical concept of homeostasis 
has been extended by Davis (1950) 
and Gellhorn (1953) to include regu- 
lation of the CNS. Davis (1950) sug- 
gests that the regulation of the physio- 
logical state which normally varies di- 
urnally between sleep and waking (and 
which extends, when homeostasis is 
inadequate, to coma at the one ex. 
treme and to convulsions at the other) 
may be looked upon as representing a 
homeostasis of cerebral excitability, 
Such a process would imply a dual 
function of the CNS: one part or sub- 
system performing the tasks usually 
associated with the nervous system, 
and the other (probably anatomically 
coextensive with the first) devoted to 
maintaining the proper level of CNS 
excitability or “tone.” Davis does not 
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see this regulatory mechanism within 
the nervous system as comprising the 
reticulo-cortical system or any other 
specific tracts, but suggests that a gen- 
eral system plus several subsystems 1s 
most likely involved. Since negative 
feedback would be required, his theory 
implies the equivalent of intracortical 
sense organs, themselves sensitive to 
the level of excitability of cortical neu- 
tons. We will later have occasion to 
refer to the possibility of such an af- 
ferent system within the cortex. 
Gellhorn (1953) similarly proposes 
that there are present in the brain phys- 
iological adjustment reactions which 
operate to maintain the degree of corti- 
cal excitation within fairly narrow 
limits, Reasoning from a somewhat 
different point of view, he cites MIS 
dence suggesting that with excitation 
of the cortex, an antagonistic mecha- 
nism is automatically called into action 
which reduces excessive activation ; and 
that, conversely, any condition leading 
to a decrease in cortical functions auto- 
matically calls forth a release of the 
brainstem from inhibition, which in 
turn leads to a return of cortical re- 
activity, Where surgical or functional 
decortication occurs, the state of the or- 
ganism may pass from coma to con- 
vulsions as soon as the release of brain- 
Stem functions has progressed far 
enough, provided no narcotics are used 
which depress the brainstem, In light 
of this process, which Gellhorn terms 
cortical homeostasis, the convulsions 
are said to represent an ultimate adjust- 
ment mechanism (although of a patho- 
logical form) to counteract the coma 
through discharges from the brain- 
stem and particularly from the hypo- 
thalamus. Sometimes the improve- 
ment of cortical functions seen as 2 
result of a spontaneous or induced con- 
vulsive seizure is marked and S ap- 
pears normal for some time, offering, 
according to Gellhorn, an impressive 
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illustration of the homeostatic value of 
the seizure under certain conditions. 

Applying the concept of cortical 
homeostasis ? to the problem of the 
maintenance of cerebral excitation dur- 
ing sleep, we have conceptualized a 
homeostatic interplay between N REMS 
(characterized by quiescence and loss 
of cortical tonus), and REMS (char- 
acterized by excitation and heightened 
cortical tonus). To explore the im- 
Plications of this concept, we will com- 
mence with discussion of the NREM 
phase, 


Ture NREM PHASE 
Functional Deafferentation in NREMS 


Oswald (1962) has described sleep 
as “a recurrent, healthy condition of 
inertia and unresponsiveness” and Ma- 
goun (1963) has pointed out that the 
Sleeping state is characterized by the 
absence of perception. Similarly, Bre- 
mer (1935, 1938), Kleitman (1963). 
and Broughton (1966) have referred 
to sleep as causing «functional deaf- 
ferentation.” 

The degree of deaff 
sleep appears to vary wit 
Experimentation indicates that as 
NREMS proceeds from Stage 1 to- 
ward Stages 3 and 4, various forms 
of response may disappear consect- 
tively, with the greatest impairment 
being found in Stage 4. With progres- 
sion into the deeper NREM sleep 
stages, reaction times for simple man- 
ual responses have been shown to 
Crease (Coleman et al., 1959; Fisch- 
gold et al., 1959) ; EEG responses j| 
auditory stimuli to decrease, (Zung * 
Wilson, 1961) ; and discriminative €a- 
Pacity to become markedly lowered, OF 
to disappear entirely 25 5 Teak 
Stage 4 (Oswald et al., 1960). 


2 ; . +. term incor- 
ü As used in this paper, [m aeostasis © 
erates Davis concept of “hor 
*rebra] excitability,” as We% 


erentation in 
h sleep stage- 
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Despite decreased responsiveness in 
the “deeper” sleep stages, however, 
evidence suggests that considerable de- 
afferentation is experienced even in 
Stages 1 (descending) and 2. When 
Rechtschaffen and Foulkes (1965) 
taped open the eyes of sleeping 5s and 
illuminated various objects held di- 
rectly in front of them, they observed 
that in two of the three Ss studied 
"relative functional blindness" seemed 
to set in even before the onset of Stage 
1. These investigators found no evi- 
dence of incorporation of stimuli into 
subsequent verbal reports from any of 
the sleep stages, and had the impres- 
sion that functional blindness com- 
menced during the phase of EEG alpha 
rhythm and slow conjugate eye move- 
ments which usually precede unam- 
biguous sleep. This suggests that 
functional deafferentation, at least in 
the visual sphere, may be virtually total 
at sleep onset. 

Similarly, while Ss are less respon- 
sive to auditory stimuli during sleep 
Stages 3-4 than they are during Stage 
2, male Ss studied by Zung and Wil- 
son (1961) were unresponsive to au- 
ditory stimuli in at least half of the 
trials administered during Stage 2, with 
female Ss unresponsive to such stimuli 
in about 25-30% of the trials during 
this stage (Zung & Wilson, 1966). It 
seems that the differences which exist 
between Stages 2, 3, and 4 sleep with 
respect to the degree of deafferentation 
experienced may be considerably less 
impressive than the dramatic contrast 
between sleep and the waking state 
with respect to this variable. We 
will therefore refer to NREMS, col- 
lectively, a$ «deafferenting sleep. 


A paradoxical aspect of loss of vigi- 
at certain evoked 


in sleep is th 
lance in tical level may be 


atials at the cortice j 
enna as the individual sinks into 
the deeper NREM sleep stages (Fa- 


vale, Loeb, Manfredi, & Sacco, 1965 ; 
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Herz, 1965). Cortical excitability in 
slow-wave sleep thus appears similar 
to that seen under general anesthesia, 
where evoked potentials are sometimes 
displayed with unusual clarity. Such 
phenomena tend to occur, in fact, only 
in states where the brain has under- 
gone profound modification of func- 
tions so that discriminatory responses 
to external stimuli are sharply reduced 
or absent. 

Because these clearly evoked re- 
sponses occur at times of relative un- 
consciousness, it has been assumed that 
the arousal of the primary neural vol- 
ley (ie. the arrival of an “incoming 
signal”) at cortex from a stimulus is 
not sufficient to produce perception. It 
seems, rather, that more (or different) 
neuronal activity must be present to 
evoke awareness of the stimulus—ap- 
parently an evoked response must be 
"processed" by the brain in order to be 
effective as communication (Dustman 
& Beck, 1965). Certain evoked corti- 
cal potentials in sleep might thus be 
likened to a telephone ringing in a 
room with no one picking it up. The 
"signal" may enter the cortex in a 
crude fashion during sleep, but precise 
information is not conveyed because 
structures capable of processing this in- 
formation appear to be unresponsive at 
this time. 

In this paper such incompleted af- 
ferentation during sleep is referred to 
as “functional deafferentation.” The 
result of such “deafferentation,” as 
Magoun (1963) has pointed out, is a 
profound alteration in the interaction 
between the organism and the external 
world. This in turn could be expected 
to markedly lessen the organism’s 
adaptive capacities. 


Mental Operations during Sleep 


The question of the effectiveness of 
cortical functioning during sleep ap- 
pears central to our discussion, since 
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cortical homeostasis presumably oper- 
ates to maintain the efficiency of the 
higher brain centers. Hebb (1949) 
has suggested that loss of sensory in- 
put in sleep causes disruption of “phase 
sequences” (unified neural processes 
built up through learning, which result 
in organized thought) and some ex- 
perimental studies of artificially in- 
duced sensory and/or perceptual depri- 
vation appear to support the notion 
that operation of phase sequences de- 
pends upon a steady stream of sensory 
input. After some time has passed in 
the isolation chamber, with external 
stimulation sharply restricted, nearly 
all Ss report a lack of clarity in think- 
ing, with difficulty in directing their 
thoughts (Bexton, Heron, & Scott, 
1954; Lilly, 1956). 

Assuming that the quality of menta- 
tion may be indicative of the degree 
to which phase sequence, or organ- 
ized thought, is operating at any given 
time, we might consider the nature 
of mental content reported on awaken- 
ings from both NREMS and REMS 
in an effort to shed further light on the 
relative effectiveness of cortical func- 
tioning in the two sleep phases. 

As indicated, experimenters are in 
substantial agreement that Ss recall 
mental content from all stages of sleep 
(Dement & Kleitman, 1957b; Good- 
enough et al., 1959; Orlinsky, 1962). 
Indeed, despite numerous changes 
found in brain functions during 
NREMS, in about 20% of awakenings 
from this sleep phase, Ss report that 
they have been “awake” and “think- 
ing” about matters of interest to them. 
Does this mean that these Ss are effec- 
tively oriented toward reality at this 
time? Obviously this finding calls for 
Careful evaluation since it is difficult 
to conceive of a neurophysiological 
mechanism by means of which ordi- 
nary secondary process thought could 
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be maintained during a state of mark- 
edly lowered vigilance. 

. The occurrence of integrated think- 
ing during NREMS seems ques- 
üonable in light of recent evidence 
which suggests that NREM sleep may 
Impede the consolidation of memory 
traces (Portnoff, Bakeland, Good- 
enough, Karacan, & Shapiro, 1966). 
Even for moderately organized think- 
ing, short-term memory is essential, 
and it may therefore be significant that 
à vague, wandering, fragmented quality 
is frequently attributed to “thoughts” 
reported on awakenings from NREMS 
(Foulkes, 1962, 1964; Kamiya, 1961; 
Rechtschaffen, Verdone, & Wheaton, 
1963). 

. While at present there are no en- 
tirely satisfactory answers to the ques- 
tion of whether or not "thinking" oc- 
curs during NREMS, the possibility 
Should be considered that some, per- 
haps even the majority, of these think- 
Ing reports actually refer to mental 
Operations occurring during the transi- 
tional period between sleep and waking. 
Support for such a hypothesis may 
be found in the experimental work of 
Goodenough et al, who have shown 
that thinking reports are more likely 
to occur following gradual awakenings 
from NREMS than following abrupt 
awakenings from this sleep phase 
(Goodenough, Lewis, Shapiro, Jaret, 
& Sleser, 1965). 

While these data suggest that some 
NREM thinking reports may bart 
Ports of awakening (hypnopompic) 
experiences, NREM dreams p up 
as often on abrupt as on gradual cdd 
tnings, It seems more likely, there- 
fore, that these dream reports reflect 
Some form of mental activity actually 


Occurring during NREMS. ts = rud 
. epor 
(1962) obtained dream pon dreams 


of NREM awakenings. 1 
are, by definition, perceptual experi- 
ences, and we might wonder if they 
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reflect some “need” for sensory stimu- 
lation or “input” during sleep. Such 
a possibility will be considered when 
we discuss the more full-blown, vivid 
dreams associated with the REM 
phase. 

Data which might shed additional 
light on NREM thinking reports are 
the brief periodic lightenings of the 
EEG pattern, commonly noted through- 
out NREMS (Kleitman, 1963), sug- 
gesting fleeting arousals followed by 
rapid return to sleep. It seems plaus- 
ible to assume that at these moments 
of EEG arousal some form of thinking 
may take place, with those thoughts 
remembered by S when he is awakened 
in the laboratory. If during the inter- 
vening period of NREMS the sleeper 
had been operating in a diffuse, disor- 
ganized mental stage, he might have 
no way of estimating just when these 
thoughts had occurred. It might 
therefore seem to him that he had been 
thinking just prior to being awakened.? 
Taking the above possibilities into ac- 
count, we would interpret subjective 
reports of secondary process thinking 
during NREMS with caution at this 
point. 

In between 2696 (Foulkes, 1962) 
and 54% (Kamiya, 1962) of awaken- 
ings from NREMS, Ss seem unable to 
recall any mental content whatever. 
This suggests the possibility that peri- 
ods of total unconsciousness take place 
during NREMS; or, alternatively, that 
mentation of a nonverbal, nonpercep- 


tual nature occurs which cannot be 


recalled. 
Some observations by Gastaut and 


Broughton (1965) on behavior fol- 


3 Cf. James (1890) and Head (1923) for 
sion of the tendency to preserve an il- 
mental continuity despite actual 
gaps in the stream of consciousness. Such 
illusions of continuity may take place follow- 
ing petit-mal seizures, for instance, or fol- 


lowing fast-acting anesthetics. 


discus 
lusion of 1 
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lowing abrupt awakenings from 
NREMS shed further light on the 
level of cortical efficiency in this sleep 
phase. These investigators found that 
children awakened from Stages 3 
and 4 sleep (usually only with great dif- 
ficulty) showed extreme confusion, 
were amnesic with respect to the awak- 
ening experience, and displayed behav- 
ioral evidence of continued sleep even 
though they showed awake EEG pat- 
terns. The Ss could answer questions 
only after intense stimulation and then 
in a weak voice and with simple abrupt 
answers. Such confusional reactions 
were never present during awakenings 
from initial Stage 1, from Stage 2, or 
from REMS; in fact, it is reported 
that 5s awakened abruptly from REMS 
became immediately alert (Broughton, 
1966). The confusional reactions re. 
ported in one-half of the normal chil- 
dren awakened from Stage 4 are con- 
sistent with the hypothesis that a 
breakdown of organized thought proc- 
esses occurs in NREMS. 

Although mental content which is 
remembered on awakenings from 
NREMS appears to be “drifting,” “dis- 
organized,” and “nondirected” (Recht- 
schaffen, Verdone, & Wheaton, 1963), 
mental content recalled from REMS is 
described as being vivid and elaborately 
organized (Foulkes, 1962; Kamiya, 
1961;  Rechtschaffen, Verdone, & 
Wheaton, 1963). Foulkes thus char- 
acterizes the transformation from 
NREM to REM sleep as progressing 
“from kaleidoscopic flux (NREMS) 
to continuity and integration (REMS)” 
(Foulkes, 1964) with a shift “from 
fragmentary and disjointed content to 
elaborated and integrated content" 
(Foulkes, 1962). r . 

A novel approach to studying shifts 
in mentation between the two sleep 
phases is reported by Fiss, Klein, and 
Bokert (1966). Comparing TAT 
stories told on awakenings from REMS 
with those told on awakenings from 
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NREMS, these investigators found 
that while blocking and relative paucity 
of ideation characterized the stories 
from NREMS (in 40% of such awak- 
enings Ss declaring, "I can't think") 
such blocking never occurred aíter any 
REMS awakening. By contrast, sto- 
ries obtained after interrupted REM 
periods tended to be vivid, imagina- 
tive, bizarre, and emotional, with REM- 
awakened Ss showing faster reaction 
times, 

Along similar lines, Mandell, Man- 
dell, and Jacobson (1965) report evi- 
dence of special mental alertness dur- 
ing REMS, in contrast to NREMS. 
When these investigators read digit 
series to sleeping Ss, asking them to 
repeat the numbers upon being awak- 
ened 5 seconds later, Ss were found to 
have superiority of recall for digits 
Presented during REMS, while they 
failed to recall any of the digits pre- 
sented during NREMS. In like man- 
ner, Cobb et al. have shown that Ss 
can give accurate behavioral responses 
to complex meaningful suggestions ad- 
ministered during Stage-1-REM, but 
not to suggestions administered during 
NREMS (Cobb, Evans, Gustafson, 
O'Connell, Orne, & Shor, 1965). 


Sleep Disturbances in NREMS 


We have cited evidence which sug- 
gests that descent into NREMS is ac- 
companied by progressive disorganiza- 
tion of the mental processes which 
ordinarily serve adaptation in the wak- 
ing state. Since loss of vigilance pre- 
sumably threatens any organism that 
depends on cortical alertness and per- 
ceptual orientation for survival, we 
might wonder whether anxiety oF 
Stress reactions are found in the deeper 
NREM sleep stages. 

Homeostatic mechanisms are nor- 
mally anticipatory in nature, forestall- 
ing pathologic conditions and thus 
largely obviating the necessity for 
Stress reactions, but such responses are 
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not always prevented. Perhaps differ- 
ent persons vary in the degree to which 
they can “risk” loss of awareness in 
NREM sleep, just as in waking life 
we find marked interindividual dif- 
ferences in tolerance to extreme sensory 
deprivation. 

It may be noteworthy that physio- 
logical states suggestive of stress occur 
In certain Ss concomitantly with lower- 
ing of vigilance in NREMS. Possible 
stress reactions noted at this time in- 
clude episodes of sleepwalking which 
have been shown to commence exclu- 
sively in NREMS (usually in Stage 
4) (Gastaut, Batini, Broughton, Fressy, 
Roger, & Tassinari, 1964; Jacobson, 
Kales, Paulson, Lehmann, & Zweizig, 
1965a, 1965b) and which are fre- 
quently accompanied by signs of fear 
Which according to Jacobson et al. 
(19653) may vary from mild anxiety 
to outright terror on the part of the 
somnambulistic S (and during which 
Ss frequently show increased or even 
doubled heart rate [Gastaut & Br ough- 
ton, 1965]).* Also suggestive of stress 
are the sleep terrors of children ob- 
Served to occur exclusively in Stage 
Sleep (Gastaut & Broughton, 1965) ; 
episodes of enuresis which have been 
shown to occur almost exclusively ™ 
NREMS, often in Stage 4 in the 1 
Minutes preceding REMS (Broughton 
& Gastaut, 1963; Hawkins, Scott, & 
Thrasher, 1965) ; and severe nocturnal 
anxiety attacks in adults (the classical 
nightmares”) which tend to take place 
either during NREMS or concurrent 


* Sleepwalking incidents lasting mor? than 

0 seconds have been shown to pa 
companied by a change in the EEG tracing 
to low voltage faster frequencies (Kales, 
acobsen, Paulson, Kales, & 
Our point of view suggests that such 
incidents could be partial “arousal A 
Sponses occurring in reaction ip sue 
NREMS. This however does not imply tha 
NREMS is an activated state, but rather 
that it may be interrupted, under certam 
conditions, by episodes of partial activation. 
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with an awakening from this sleep 
phase (Gastaut & Broughton, 1965). 

Some of these reactions could also 
be interpreted as results of release 
from cortical inhibition during the 
deeper stages of sleep. Two tentative 
explanations are suggested for these 
disturbances: (a) Deafferentation in 
NREMS may lower thresholds for 
stress reactions in chronically anx- 
ious subjects; and (b) cortical de- 
pression in NREMS may lead to corti- 
cal release phenomena in certain pre- 
disposed Ss. These possibilities are 
not mutually exclusive. Possibly there 
is an interaction between stress and re- 
lease factors in the sleep disturbances. 

In addition to pathological behavior 
in Ss inclined towards episodic phe- 
nomena in sleep, even entirely normal 
Ss seem especially vulnerable to bi- 
zarre reactions if awakened abruptly 
from NREMS. Thus Gastaut and 
Broughton have been able to provoke 
varying degrees and durations of auto- 
matic behavior (verbal, gestural, or 
even ambulatory, i.e., an induced sleep- 
walking attack); relative behavioral 
nonreactivity to exteroceptive stimuli ; 
retrograde amnesia for intercurrent 
events and other "symptoms" of sleep 
disturbances in entirely normal per- 
sons, merely by the expedient of abrupt 
forced awakenings from Stage 3 or 
4 sleep (Broughton, 1966). It may 
be pertinent to recall Dement's com- 
ment on NREMS, to the effect that 
“there is some small hint that in cer- 
tain rare instances, the ultimate, in- 
Je terrors may arise out of 
this time of possible vulnerability." * 
Dement cites one adult $ who on two 
or three occasions abruptly terminated 
a period of NREM sleep in his labora - 
tory by sitting bolt upright and emitting 

“ploodeurdling scream” although 


specifically to the pos- 
“repression barrier” in 


expressib 


5 Dement referred 
sible lowering of the 
deep NREMS. 
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there was no dream recall or even re- 
call of screaming (Dement, 1964). 


Tne REM PHASE 
Cortical Activation during REMS 


To summarize thus far: We have 
presented evidence suggesting that 
NREMS is characterized by functional 
deafferentation ; lowered cortical tonus ; 
reduced vigilance; general disorganiza- 
tion of associative processes (loss of 
"phase sequence") and, in certain Ss, 
episodic phenomena which might be 
interpreted as indicating stress. We 
will now turn to REMS. 

In contrast to NREMS, most areas 
of the neocortex are as highly active 
during REMS as they are in the most 
alerted waking state (Dement, 1964, 
1965a; Roffwarg, Muzio, & Dement, 
1966; Snyder, 1965b). During the 
rapid eye movement phase there is a 
fast, irregular, low voltage (desynchro- 
nized) EEG pattern resembling an 
awake pattern except for relative ab- 
sence of alpha activity. At this time 
cortical blood flow increases (Kanzow, 
Krause, & Kühnel, 1962), and hippo- 
campal theta rhythms (Shimazono, 
Horie, Yanagisawa, Hori, Chikazawa, 
& Shozuka, 1960), pyramidal tract 
activity, (Arduini, Berlucchi, & Strata, 
1963), cortical direct current poten- 
tials (Wurtz, 1965), cortical induced 
movement thresholds (Hodes & Su. 
zuki, 1965), and auditory evoked re- 
sponses in animals (Herz, 1965), to- 
gether with numerous other measures, 
show marked changes which bear a 
striking resemblance to the most at- 
tentive waking state. 


9 While the above findings suggest that 
cortical tonus is enhanced during REMS, 
evidence is less clear-cut with respect to 
vigilance in this state. During REMS a rise 
in arousal thresholds is reported in cats 
(Jouvet & ]Jouvet, 1963) while „lowered 
arousal thresholds are reported in adult 
humans (Rechtschaffen et al., 1966). 


HARMON S. EPHRON AND PATRICIA CARRINGTON 


This neuronal storm is not accom- 
panied by corresponding behavioral 
activation, however. Except for spo- 
radic jerks or twitches, S remains limp 
during REMS. Nevertheless, cats 
with bilateral destruction of the locus 
coeruleus (a noradrenergic nucleus in 
the pons which appears to account for 
the muscular atonia of REMS) behave 
differently. During REMS these cats, 
while totally unresponsive to actual 
visual stimuli, are observed to stare 
fearfully at invisible objects, to stand 
on their feet and exhibit ragelike be- 
havior, and even to “stalk” imaginary 
moving objects, in marked contrast to 
their quiet state during NREMS 
(Jouvet, 1965b). Thus, with specific 
motor-inhibitory centers ^ removed, 
REMS gives indications of being an 
intensely activated state both behavior- 
ally and neurophysiologically. . 

Another impressive characteristic 
of REMS is the massive activation in 
the sensory receiving areas of the 
brain which occurs in the relative ab- 
sence of any external sensory input 
(Bizzi & Brooks, 1963; Brooks & 
Bizzi, 1963). Evarts (1962) reports 
that single neurons in the primary vis- 
ual cortex of the cat typically have à 
70% higher rate of discharge during 
REMS than during NREMS, and 
Bizzi (1965) has shown that the spon- 
taneous discharge of large numbers 
of lateral geniculate neurons is modi- 
fied during REMS, resulting in an 
increase in rate of firing in the ma- 
jority of these cells, Even the modu- 
lations of tone in the fine muscles of 
the middle ear show phasic and tonic 
changes during REMS, just as they do 
in animals responding attentively to 
waking events (Baust, Berlucchi, & 
Moruzzi, 1964 ; Dewson, Dement, & 
Simmons, 1965). 

Current data thus suggest that 
REMS may approximate a state of 
endogenous afferentation, and Roff- 


RE 
REM Steer AND CorticaL HOMEOSTASIS 


war. 
lined ae Fera have recently out- 
lerentatior chanism by which such aí- 
thors die aa, take place. These au- 
bursts of findings which suggest that 
"REM gl impulses emanating from a 
to the nue d center" in the pons ascend 
along E amus whence they course 
Projection. primary thalamo-cortical 
tally-indy pathways (as environmen- 
receiving = impulses do) to sensory 
Ment (1964) 1 in the cortex; and De- 
this "int ) has suggested that once 
stituted! a sensory input” is “sub- 
chain,” inn the “stimulus-response 
Teact to eier centers interpret and 
Dercepts ; as if it were a set of true 
System f impinging on the nervous 
t REM without. 
ogenous IS represents a state of en- 
*rentiate WEE input, we might dif- 
P erentation. states with respect to 
WW : 
°Benous aber (predominantly ex- 
ise Ep REM during which 
ag Pets, eni regularly receives, in- 
Auli, responds to external 
.*. NRE 
xm) ^ie ad (functional deafferenta- 
ie e pins Ww hich, the organism may 
Tpret an pew stimuli, but will in- 
lg pu ee ee only in 
aS REM fashion, if at all. 
Nous affere (predominantly 
; ntation) during W 


Or 
Sani 
e sm i 

Xte is largely unrespons 


endog- 
hich the 
ive to 
same 


ti Sins b 

xe a d timuli, while at the 
mulation ee increase in internal 

f iy: OP occurs, with the presence 


gest- 


me, . Rt this i i 
e his input is patterned an 


"Ingfu] 
How 
S vie 
in the. of the polarity o 
eh, we ^ue with respect to 
at ave considered the 


f the two 
afferenta- 
possibility 
unctional 


e T 

inte Po may have the f 3 
ja mara teraeting deafferentation 
See S. More broadly, it might 
rtex fol- 


des s 
reinvigorating the cO 


511 


lowing depletion i i 
E A oe i of cortical functions 
The dreaming process, an integral 
part of human REMS and one pein s 
common to many lower animals as € 
(Dement, 1965a; Snyder, 1965a) may 
be an important component of this s 
dogenous afferentation. In order to 
examine this possibility we will con- 
sider evidence suggesting that REM 
dreaming, as an hallucinatory process 
rovides the sleeping brain with pat 
tierned sensory input. 
Hallucinations as Replacement Phe- 
nomena 
Hallucinati 
or auditory) 


ons (usually either visual 
have been reported to oc- 
cur in certain persons under conditions 
of experimental sensory deprivatior. 
(Freedman, Grunebaum, & Greenblatt 
1961; Vernon, McGill, Gulick, & 
Candland, 1961) and apparently it is 
not unusual for persons subjected to 
reduced sensory input to greet these 
isions as à form of welcome relief 
from monotony, In similar manner, 
i rs, and others in 

isolation are fre- 
quently reported to welcome their hal- 
lucinations ( Miller, 1962). It would 
seem that with diminished stimulation, 
stimulation of any kind may be avidly 
sought. Lilly (1956) for example de: 
scribes à «stimulation-action AMET: 
intense in 95 undergoing sensory ia 
rivation that stroking one finger with 
another becomes an ineffable pleasure. 
We might won er whether those hal- 
-periences which are 


lucinatory Pef 
greeted wih relie 


eftec 
replacement f 
oP deprived of sensory 


a 

nism enso! 
pu egarded as creating its 
which it subsequently 


hallucinatory 


ht 
to 


attends: 
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experiences in patients deprived of 
sensory input through illness or in- 
jury tends to support the notion that 
replacement of absent sensory input 
may be a function of hallucinations. 
Bartlett (1951), in his description of 
an old man with cataract, showed that 
the hallucinatory activity occurred only 
in that limited portion of the visual 
field which corresponded to the posi- 
tion of the cataract; while observations 
of  phantom-limb experiences are 
equally specific in showing strictly 
limited hallucinosis suggestive of re- 
placement phenomena (Bartlett, 1951; 
Fisher, 1962). Similarly, auditory hal- 
lucinations may occur in individuals 
having a progressive loss of hearing, 
with musical perceptions frequently 
noted in such patients (Rosanski & 
Rosen, 1952), 

Still further, schizophrenia, a con- 
dition often accompanied by hallucina- 
tions, has recently been compared to 
& state of sensory deprivation (Rosen- 
sweig, 1959; Searles, 1963). As 
Searles indicates, schizophrenic pa- 
tients may become so isolated from 
their environment that they scarcely 
perceive persons or objects in the natu- 
ral world. A view of schizophrenia 
as a form of sensory deprivation sug- 
gests that the hallucinations of this ill- 
ness might be seen, at least in part, as 
results of the sensory-deprived state of 
the patient. If so, the hallucinations 
might represent a pathologic attempt 
to counteract the deprivation through 
creating an illusory world of the 
senses. 

In similar manner, we might view 
the experience of dreaming as a form 
of recovery following functional deaf- 
ferentation in NREM sleep. Just as 
the REM state is characterized neuro- 
physiologically by activation of the 
sensory receiving areas of the brain; on 
the psychological level, the dream proc- 
ess (at least in man) appears to repre- 
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sent a form of intricately patterned 
perception. An elaborate structuring 
of the spatio-temporal field takes place 
during dreaming, and there is evidence 
that the length of the dream is roughly 
proportional to the hallucinated dream 
events occurring within it (Dement & 
Wolpert, 1958). Furthermore, sub- 
jective evidence indicates that early 
sequences which form part of a long 
dream are often recalled by the dreamer 
during a later sequence of the same 
dream; just as in waking life we typi- 
cally remember events leading up to the 
present. Such references to the “past 
of the dream, experienced during the 
dream, suggest that a complex memory 
process operates during REMS, and, 
as we have seen, there is also some ob- 
jective evidence of ability to commit 
events to memory at this time (Man- 
dell et al., 1965). Similarly, juxta- 
position of dream objects in space, and 
perception of motion in the dream, sug- 
gest a sophisticated (though still some- 
what distorted) orientation in space 
during REMS. Evidence thus points 
to an extensive recovery of integrative 
functions during the rapid eye move- 
ment dream, seemingly in contrast to 
the disintegrative trends noted in 
NREMS * (see above). 


IMPLICATIONS OF A HOMEOSTATIC 
Hyvoruesis 


The Sequence NREMS/REMS 


If, as suggested, REMS is triggered 
in reaction to loss of cortical tonus in 
NREMS, then it should ordinarily fol- 
low NREMS, This appears to be the 


“It would seem that NREM dreams may 
fulfill a sporadic afferenting function, similar 
to that of the REM dream, although they are 
milder and not accompanied by the physio- 
logical changes of the REM state. Perhaps 
the intermittent sensory “input” which they 
afford contributes toward the maintenance of 
a minimal level of cortical tonus during 
NREMS which is not sufficiently pronounced 
to register by electrophysiological means. 
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case. Normal mature human Ss, 
for example, do not show REMS un- 
til roughly 60 to 90 minutes have 
passed after sleep onset, and an initial 
Stage-4-Stage-I-REM contingent re- 
lationship has been postulated by Webb 
and Agnew (1965, 1966) based on 
these investigators! normative studies 
of sleep patterns in young adults. 
Similarly, a period of NREMS has 
been shown to precede REMS in ma- 
ture animals? (Levitt, 1965; Roldan, 
Weiss, & Fifkova, 1963; Sawyer, 
1965) except where the sleep cycle 
has been radically altered by manipu- 
lation as in REMS-deprivation. 

NREMS also appears to represent 
an indispensable condition for the oc- 
currence of artificially stimulated 
REMS phenomena. Jouvet and Michel 
(1960a, 1960b) have demonstrated, 
for example, that in the intact cat it 
is possible to induce REMS by stimu- 
lating certain points in the midbrain or 
Dons provided this is done during 
NREMS. Furthermore, having once 
Stimulated REMS in this manner, it 
Was impossible for these investigators 
to repeat the effect for 15-30 minutes. 

his refractory interval suggested E 
them that REMS is due to the libera- 
tion of some neurohumoral agent fo 
Whose production and accumulation 
NREMS, or something associated with 
it, provides the stimulus. 

Sleep experimentally produced by 
Hernández-Peón (1964) through m 


Pim means, and by ead Iso has 
di : ation, 4 n 
rough electrical stimu MS, NREMS 


hot led directly to REM», ^ 
always occurring first and giving ME 
to a subsequent REM phase. Also 0 
interest, a study by Sovrad and Kama- 


Tova (1965) has shown that ine dl 
jected with bulbocapnine DOM sop” 
resses the amplitude of the high-vo 

8 Weitzman, Kripke, Pollak, & Dorn 
(1965) report occasi onse 
mn "preadolescent" mon! 
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age slow waves of NREMS) show no 
episodes of REMS until the amplitude 
of the high-voltage sleep waves has 
returned to base-line level (i.e., until 
sufficient “depth” or “intensity” of 
NREM sleep has been restored). 
NREMS thus appears to be a pre- 
requisite for REMS unless certain 
types of unusual experimental manipu- 
lation or pathological states interfere 
with the normal cycle. 

Narcoleptics tend to commence 
sleep with REMS, and Rechtschaffen, 
Wolpert, Dement, Mitchell, and Fisher 
(1963), have suggested that this illness 
represents a specific disturbance of the 
pontine system leading to pathological 
over-activity or premature triggering 
of the REM sleep center in the pons. 
A prominent EEG feature in narcolepsy 
is the frequent appearance of drowsi- 
ness and sleep patterns during the day, 
even when S appears to be wide awake 
and alert (Hishikawa & Kaneko, 
1965); and Hess (1965) points out 
that “the narcoleptic’s brain may be 
asleep while he is walking, working, 
etc.” Possibly, therefore, we might be 
justified in viewing the narcoleptic as 
experiencing an abnormal degree of 
functional deafferentation during wak- 
While the explanation for 
EMS in narcoleptics ap- 
he realm of neurophysi- 
ological pathology, conceivably a con- 
tributing factor might be an urgent 
need for endogenous afferentation cre- 


ated by the persistent. symptoms of 


drowsiness seen in these patients. Ac- 


cording to the hypothesis here pre- 
sented, a mounting need for afferenta- 
tion during these states of lowered vigi- 
lance might contribute to the likeli- 
hood that rapid eye movement sleep 
would be experienced prematurely. 
Neonates also typically oy = 
E ate at the beginning of sleep 
ae Dement, & Fisher, 1964 ; 
Roftwarg et al., 1966), and according 


ing life. 
sleep onset R 
pears to lie in t 
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to Roffwarg? the occurrence of sleep- 
onset REMS in infants seems to reflect 
a generally greater "need" for REM 
sleep in the infant organism which 
this author has related to the require- 
ments of the maturing brain for endog- 
enous sensory stimulation, since at that 
period there is relatively little oppor- 
tunity for exogenous input. 

Some findings by Jouvet and his 
group (Jouvet, 1965a) also have bear- 
ing on the NREMS/REMS sequence, 
These investigators have demonstrated 
that decorticate (pontile) cats regu- 
larly show periods of muscular atonia 
together with phasic activity (includ- 
ing rapid eye movements) similar to 
that seen during REMS in intact cats, 
while these pontile animals show no 
NREMS. Despite the absence of cor- 
tex, the REMS system appears to re- 
main in operation as long as its pontine 
triggering mechanism remains intact. 
While these investigators seem clearly 
to have demonstrated an independence 
of REMS from NREMS in their 
decorticate preparations, we need not 
assume that such an independence ob- 
tains in unoperated animals. As pre- 
viously suggested, it may be that where 
there is a cortex, the occurrence of 
REMS serves to restore cortical func- 
tions following lowered cerebral vigi- 
lance in NREMS. It may be pertinent 
in this connection to recall that ani- 
mals that naturally have little neo- 
cortical tissue, such as certain reptiles, 
do not show any clearly identifiable 
REM sleep. 


An Accumulable Tendency toward 
REMS 


Influences which tend to deepen 
sleep, such as some drugs (Feinberg, 
Koresko, Gottlieb, & Wender, 1964. 
Hendley, 1963) and sleep deprivation 
(Berger & Oswald, 1962), may have 


9H. P. Roffwarg, personal communica- 
tion, July 27, 1965. 
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the immediate effect of decreasing the 
amount of REMS (although typically 
REMS is later elevated above base- 
line levels in apparent compensation for 
the earlier suppression). Thus REMS 
is sometimes lessened when cortical 
tonus reaches exceptionally low levels, 
suggesting temporary postponement of 
the hypothesized homeostatic regula- 
tion. This may occur because under 
certain conditions (such as excessive 
fatigue) NREMS takes precedence 
over REMS, antagonizing the physio- 
logical mechanisms responsible for " 
(Roffwarg et al., 1964). Since al 
homeostatic mechanisms can be 
thwarted by some circumstances, per- 
haps it might be pertinent to eae 
guish between the effects of norma 
deep sleep on the one hand, and the 
possible effects of deep sleep due to an 
excess of noxious factors (extreme 
fatigue, drugs, etc.) on the other, 
since it is the latter conditions which 
appear capable of upsetting the norma 
REMS cycle, 

Evidence suggests, however, st 
Postponement of the hypothesize 
homeostatic responses may also occu" 
under conditions which can be con- 
sidered commonplace, as, for example 
When a waking state intervenes: 
Webb1° reports on Ss brought into 
the laboratory 2 hours after a, fi 
night’s sleep (9 A.M.) and permitte 
to sleep for 2 hours. These Ss dis- 
played remarkably elevated amounts © 
REM sleep during these morning naps 
(around 40%), similar to the concen- 
tration of REMS seen during ae 
morning hours of uninterrupted nigh 
sleep, while they showed almost 70 
Stage 3-4 sleep, often bursting int? 
REMS immediately, 1 

Triggering of REMS by directly 
preceding NREMS could not have = 
curred in this case. However, W 


19 W, p. Webb, personal communicatio? 
Tune, 1966. 
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Bests that 2 the cortex, evidence sug- 
ation is bi. a deal of waking stimu- 
Clency js at before cerebral ef- 
B oma: ‘ed restored after sleep. 
seems to m a person's performance 
he morni € worse after getting up in 
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A UE to bed. There fol- 
m improvement as the 
n after g e was lengthened, but 
led hours of sleep, the per- 
: as not up to the presleep 
the Siow inn therefore, 
seo Cither ov I deafferenting process 
tion” or con ershoot its mark during 
te throy pe that reafferenta- 
Unterac es REMS does not fully 
© end of loss of cortical. tonus by 
all residual cot 
a erentatio effects from the night's 
tea a cep o which persist into the 
py tentation ja hours are postulated, 
awape OBical may benan important 
diyi doang ih Fi me following 
this ual wh he morning, with an in- 
hypog © returns to sleep before 
hetical deficit is met 9? 


tis; 
int x 
yt €d the ge that Omwake's subjects 
e loss of cerebral efficiency 

st 2 hours of sleep where 


usu. 
;"Sually predominates, with a Pro" 


im 
Drovement in task performance 
ight where 


Most Cren; the portions of the nig 
g tation" E 
kep Lon" through REMS might be 


i 
e 
to occur. 


L P 
thalamo-cortica 


515 


countering a cortical de 

on : mand for inten- 
siv a sensory stimulation in REMS 

emporary postponement 
T of 

pen response as described ds 
clearly implies an accumulable tendenc 
to produce REMS. Because of its 


importance for an understanding of the 


proposed homeostatic rinci 
possibility will now ae i 
According to the “homeostatic” 
theory, the neocortex regularly calls 
upon centers in the pons to supply it 
with necessary stimulation durin 
sleep. We might conceive of a d 
tionally deafferented cortex sendin 
messages to a REM sleep center in vi 
pons demanding endogenous afferenta- 
tion, which is then effected by means 
of impulses sent coursing along ponto- 
] circuits to the sensory 
areas of the brain. Such a 
ld utilize the equivalent of 
e organs (as Davis, 
1950, has suggested) capable of re- 
sponding to the level of cortical tonus 
even during deepest NREMS; or it 
might operate in some other as yet un- 
suspected fashion to monitor cerebral 


excitation. 
Tt seems reasor 


receiving 
process cou 
intracortical sens 


nable to suppose, how- 
ever, that a homeostatic mechanism of 
such presumed importance for the 


roper functioning of the brain would 
upon immediate feedback 


tinuing operation 
logical timing 
resistant to 


mechanism essentially — 
change. Jouvet and Michel (1960a, 
(1965b) have of- 


d Dement 
ich suggested the ac- 


e biochemical sub- 


h triggers ^v 
the notion of a bio- 


triggering mechanism may 
or the proposed homeostatic 
biochemical triggering 


no means the only ex- 


chemica 
have value 


hypothests: 
substance i5 D 
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planation for an accumulable tendency 
to have REMS, however. As Hart- 
mann (1965) indicates, a combination 
of substances, the lack of a substance, 
or altered sensitivities of certain cell 
groups might explain the facts as well. 
Whatever the exact nature of such a 
process, eventual restoration of cortical 
tonus in the face of temporary post- 
ponement implies a mechanism capable 
of registering the effects of loss of 
tonus and retaining this information 
over a period of time. One possibility 
that might be considered is that lower- 
ing of vigilance and consequent deaf- 
ferentation in NREMS automatically 
permits accumulation of a REMS- 
triggering substance, with eventual on- 
set of REMS ensured once this sub- 
stance reaches a preset level of concen- ` 
tration. While a REMS “potential” 
would thus be built up during deaf- 
ferented states, onset of REMS could 
be deferred under conditions not fay- 
oring immediate endogenous afferenta- 
tion. Such might be the case, for 
example, at the beginning of the night 
when the need for slow-wave sleep 
may preempt REMS, when a waking 
state intervenes, or when REMS is 
artificially prevented by experimenta] 
manipulation. 
A mechanism whereby lower 
lance causes accumulation of a REMS 
triggering substance could have the 
advantage of sensitively balancing the 
degree of eventual cortical recovery 
with the degree of previous Jo 


ed vigi- 


; SS of 
cortical tonus, while at the same time 
permitting delay in timing of the 
homeostatic response. Conceivably 


such a process might have been operat. 
ing in Webb’s napping | Ss. A con. 
centration of deafferenting sleep im. 
mediately prior to the onset of REMS 
would not haye been required to trigger 
the REMS process according to such 
a hypothesis, since the triggering sub. 
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stance could have accumulated during 
the previous night's sleep. 


Cortical Homeostasis in Human Sleep 


Since most homeostatic influences de- 
pend upon numerous values that deli- 
cately balance each other, it becomes 
an awesome task to attempt to gauge 
such a process. Any theoretical model 
must fail to do justice to the interplay 
of factors to which the cortical homeo- 
Static process presumably responds. 
Since not all these factors can be 
handled at any one time, certain regu- 
latory tendencies will presumably be 
Superseded by other tendencies which 
bring about more urgent adjustments 

deficit state requiring homeostatic 
regulation should generally persist, 

owever, even though the corey 
Teaction to it is temporarily thwartec 

Evidence Suggests that a simple aC 
ion-reaction model does not describe 
the manner in which cerebral homeo- 
Stasis operates during a night of pro" 
longed sleep such as is seen in the 
adult human, Instead, what we may 

€ seeing is a competition between an 
flicting homeostatic responses, While 
certain correlations are reported be 
tween the length of NREM periods 
and the length of their subsequel 
periods in man (Hartman 
1965 ; Rechtschaften & Verdont, 1964), 
there are no similar indications of a°Y 
egg relationship between the a 
gree of loss of cortical tonus 17 . 
NREM Phase and either the degree e 
cortical activation evidenced in its p 
following REM phase or i 
t latter. For examp 
the initia] NREM phase which follow 
Sleep onset in man might be describe. 
“deepest” sleep of the night 
cause of the preponderance of Stagê , 
at this time, yet this sleep phase 
followed by the shortest and often e 
Physiologically least activated of th 


the 
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night’s REM periods. To understand 
this occurrence, it may be useful to 
view the alternation of the sleep phases 
in the broader context of the sleep- 
waking cycle. 

As Davis (1950) and Gellhorn 
(1953) indicate, cortical homeostasis 
Presumably operates to reduce cerebral 
Overexcitation as well as underexcita- 
tion. If we were to consider the awake 
State as leading toward eventual over- 
Stimulation, and the sleeping state as 
leading toward eventual understimula- 
tion, we might speculate that not only 
Must cortical efficiency be maintained 
hy oe functionally deafferented states 
Y à process of endogenous afferenta- 
tion, but conversely, that following 
States of overafferentation functional 
deafferentation becomes necessary. ^ 

Conceivably, after prolonged sensory 
»ombardment, occlusion of sensory In- 
ih 1S required to reduce cortical stim- 

ation to levels more acceptable to the 
Organism, Tf so, a homeostatic reac- 
a to overafferentation | might be a 
e fhificant factor in the initial phases 
j pormal sleep. 
we We view 
a REM phase as servir 
we effects of prolonge 
pe, ght not expect t 
afi lod to afford extensive endogenoa 
., Tentation (as evidenced by its € 
Seer. vation or duration). Tt Mid 
in ™ rather that following agumi 

nsive sensory bombardment rom 
P day, overafferented neural ho 
ey might resist further stimulation 
i REMS or by waking) a c» 
P, as sufficient recovery © bly 
nectures has taken place. E: od 
i eee the more vigorous ma j* E 
pl ns of REMS might be called king 
a only after the effects of wa oi 

ulation have been fully coun 


ted. Thi likely to be 
the... This would seem tions : 
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uninterrupted daytime wakefulness 
might therefore be thought of as placing 
a special burden on cortical homeostatic 
regulation. 

This possibility appears to receive 
some support from the fact that the 
distinct difference between the early 
and late portions of the night is first 
seen at that period in the child’s de- 
velopment when daytime napping has 
finally terminated and an overall di- 
urnal rhythm of sleep and wakefulness 
becomes established. (Roffwarg, De- 
ment, & Fisher, 1964). Also possibly 
relevant, Maron, Rechtschaffen, and 
Wolpert (1964) report that nocturnal 
sleepers who napped in the afternoon 
had shorter REM latencies than sleep- 

“ers who napped in the evening. Roff- 
warg, Muzio, and Dement (1966) in- 
terpret these various findings by sug- 
gesting that "fatigue" may develop un- 
der circumstances of prolonged arousal 
which tips the normal balance between 
REMS and NREMS, augmenting 
temporarily the “need” for deep 
NREM sleep and simultaneously an- 
tagonizing REM sleep processes. 

‘One might further conceptualize that 

's advanced wakefulness capacity 
of 15-17 hours results in mild, habitual 
deprivation of sleep which might other- 
wise occur spontaneously during the 
day. Such jntermittent sleep is fre- 
quent in infants and pu ove that 
are not trained to resist the sleep 1m- 
pulse. In these organisms, functional 
deafferentation may occur py or ps 
whenever it is required. If so, they 
should not app 


roach sleep with the 
same “need” for functi 


onal deafferenta- 
tion as adult man and we do find them 
E + relativel 


experiencing y short NREM- 


man 


les. è ft 
i equently unstable initial 

£ breh ies occur in man, how- 
REM Prig his first hours of sleep. 
en n cture that the pacing of 
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cally old and biologically primitive 
mechanism which regularly triggers 
this state in response to lowering of 
cerebral vigilance in NREMS, perhaps 
thereby serving the adaptive function 
of periodically alerting the organism 
during sleep for “sentinel” purposes as 
suggested by Snyder (1966). Mech- 
anisms initiating activated sleep may 
thus be automatically called into play 
by the first NREM phase even though 
the central nervous system may not 
yet be sufficiently unburdened at this 
time to tolerate extensive endogenous 
afferentation, This would be likely to 
result in a relatively short first REM 
period. 

A process similar to that outlined 
above might be expected to lead to a 
complex sequence of actions and re- 
actions between sleep phases. While 
not necessarily demonstrating precise 
compensatory relationships between ad- 
jacent phases, such a homeostatic proc- 
ess may nevertheless result in gradual 
approximation, over the night, to a 
“center-of-gravity” Position with re. 
spect to the degree of cortical tonus 
maintained during Sleep and subse. 
quently the degree of cerebral efficiency 
available to the sleeper upon awaken- 
ing in the morning, A Possibility sug- 
gested by Omwake’s ( 1932) experi. 
ment is that this equilibrium may not, 
in fact, be fully achieved until severa] 
hours of morning activity have sup- 
plied further stimulation. 

The model described above would 
seem in accord with the homeostatic 
process as it operates in other areas 
and as it is paralleled in servo-mecha- 
nisms, where we may see the counter- 
activity which corrects the undesirable 
deviation proceeding in an oscillating, 
irregular fashion, sometimes overdoing 
and sometimes underdoing the job it is 
called upon to perform, until some 
measure of equilibrium is achieved, 
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Functional Deafferentation in Stage 2 


We have postulated two processes 
or tendencies during sleep, the one 
leading toward organismic rest and de- 
afferentation, and the other toward 
activation. and afferentation, While 
these tendencies have seemed linked 
to certain sleep stages, this linkage 
may not be absolute. It is conceivable, 
for example, that a general tendency 
toward afferentation (or deafferenta- 
tion) might Persist despite temporary 
elimination of the specific EEG sleep 
stage with which it is normally associ- 
ated. Some evidence which bears on 
this question is reported by Fisher, 
Gross, and Byrne (1966) and Karacan 
and Goodenough (1966) who indicate 
the persistence in NREMS of a cycle 
of penile erection in human males de- 
spite REMS deprivation. Tf the REM- 
period erection were to be viewed as a 
form of organismic activation indica- 
tive of the energizing of a major sen- 
Sory receptor area, these findings sug- 
gest that with REMS eliminated, 
NREMS may assume some of the en- 
livening or “afferenting” functions or- 
dinarily linked with the REM state. 

Reasoning along the same lines, we 
might wonder if, under certain cir- 
cumstances, Stage 2 could assume 
functions such as Profound deafferen- 
tation, which are usually associated 
more directly with Stages 3 and 4. As 
Previously indicated, Ss in slow-wave 
3-4) tend to respond 
only minimally to exteroceptive stimuli 
with this functional deafferentation 
maximalized in Stage 4, While it 
might be expected, therefore, that 
these deep NREM sleep stages would 
be likely to bring about an organismic 
"need" for REMS, evidence suggests 
that Stage 2, combined with only min- 
imal amounts of Stage 3, may also 
Perform such a function. 


In a recent experiment Webb (see 
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Footnote 10) deprived Ss for 7 nights 
of most of Stage 3 and all of Stage 4 
sleep by giving a warning tone when- 
ever slow waves appeared on the EEG, 
followed by an electric shock to the 
foot if Stage 3 continued. The re- 
sulting records showed a greatly in- 
creased Stage 2 percentage with REM 
percentage not reduced significantly 
below base-line levels. These results 
might be interpreted as indicating that 
Stage 2 sleep alone is sufficiently de- 
afferenting to require endogenous af- 
ferentation through REMS, an ex 
Planation which seems plausible in 
light of experimentation showing func- 
tional deafferentation in the visual sys- 
fem at sleep onset (Rechtschaffen & 
Foulkes, 1965). On the other hand, 
We cannot be certain that the pro- 
speed Stage 2 sleep seen during 
tage 3-4 deprivation is the same, 
qualitatively. as Stage 2 sleep expert- 
“need under normal conditions. If we 
ASsume that the shocks administere¢ to 
ae do not eliminate the physiological 
ban urements for Stages 3 and 4 poe 
sle Only the manifestations of pm 
<P stages (as evidenced on i 
i G), we might wonder whether te 
creased Stage 2 seen in this exper! 
tt may not have assumed em S 
!€ functions of the eliminated sleep 
Stages, 
That the NREM sleep obtained WaS 
e aViorally “deep” despite the pr 3 
Ominance of a Stage 2 EEG is T 
Rested by the fact that Ss tolerate 
ast number of electric shocks ie 
E Phase (sometimes as many 25. : 
Shocks per night, occas! 
“sities mih S ald be ti 
a Pon awakening they be: of hav- 
5, t any subjective impressi" ^ phis 
been shocked during "ick of con- 
Soliq : of course, be due to 12 P ces edi 
N “ation of memory 4 did not 
dug TS, but the fact that a 
ily awaken with these 54 
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heavily concentrations of shock appears 
to suggest that a considerable degree 
of deafferentation was effected during 
their NREM sleep despite the enforced 
lightening of the EEG. 


REMS and Convulsive Phenomena 


To turn to consideration of the mech- 
anisms by which REMS may effect 
“toning up” of the CNS; Gellhorn 
(1953) suggests that convulsions may 
serve to restore cortical functions un- 
der certain circumstances. Perhaps it 
is significant, therefore, that REMS 
is characterized by paroxysmal phe- 
nomena (twitchings, bursts of eye 
movements, phasic CNS activity) 
which in certain respects resemble 
seizure activity. These convulsive as- 

ects of REMS are particularly notice- 
able when REM sleep is first per- 
mitted following prolonged REMS de- 
privation. At such times animals may 
show twitching of the body, paws, an 
whiskers so intense that eis is 
ally present à picture of epileptic seiz- 
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SS [on ie sop E 
ts (1964) e te 
i at the neurona 
a yn which characterizes 
H MS shows certain similarities to 
RE xysmal discharge of epilepsy. 
mu P ining the relationship of 
de Zarcone, Gulevich, 
+ (1966) report that psychi- 
fter a series of electro- 

showe 
EMS for a number of 


tenuia o tendency to make up for 
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that sleep P Dement, 1966) 
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the convulsive experience may either 
have substituted in some fashion for 
rapid eye movement sleep, or else that 
the convulsions caused temporary dys- 
function of that which mediates REMS. 

Still other findings hint at a relation- 
ship between REMS and convulsive 
phenomena. Testing seizure thresh- 
olds, Cohen and Dement (1965) report 
a marked drop in electroconvulsive 
threshold in rats after a 6-day period 
of REMS deprivation; and Winters 
and Spooner (1965), noting that very 
short REMS tend to occur just prior 
to and after induced seizure bursts in 
narcotized cats, have raised the ques- 
tion of a possible relationship between 
the REM state and generalized non- 
convulsant epileptic states. 

Tn light of the foregoing, it is con. 
ceivable that the mechanism by which 
hypothesized cortical recovery in REMS 
takes place may in some way be re- 
lated to that which brings about corti- 
cal recoveries following certain seiz- 
ures. Tf so, the relationship between 
cortical homeostasis and the occurrence 
of seizures, proposed by Gellhorn, 
might also be applied to the REMS 
process. Reasoning in this manner 
electroconvulsive shock might be me 
as supplying such intense stimulation 
to the cortex that a prolonged refrac- 
tory period ensues during which inten- 
sive stimulation by REMS is less pos- 
sible, and perhaps also less required 
from a neurophysiological standpoint, 


Related Evidence 


What evidence is there to suggest 
that the extent of the hypothetical re- 
covery in REMS reflects the extent 
of deficit brought about by NREMS? 

Two experiments may be pertinent, 
although neither was designed with 
this question in mind. Hartmann 
(1965), studying the relationship þe- 
tween sleep phases, found a positive 
correlation between the length of 


Harmon S. EPHRON AND PATRICIA CARRINGTON 


NREM periods (in humans) and the 
length of their subsequent REM peri- 
ods, while no correlation was found 
between the lengths of REM periods 
and their subsequent NREM periods.” 
He thus concluded that a need for 
REMS builds up during NREMS so 
that, other things being equal, a long 
NREM period is followed by a longer 
REM period. Along similar lines, a 
study by Rechtschaffen and Verdone 
(1964) shows a positive correlation 
between the amount of NREMS time 
Prior to the onset of the first REM 
period of the night and the length of 
the first REM period. 

One possible approach to the study of 
the interrelationship of the sleep 
Dhases might be to vary the amount 
of exogenous stimulation introduced 
into the sleep cycle during NREMS. 
and observe the effect of REMS. Tf 
greater amounts of exogenous stimula- 
tion are made available (in brief awak- 
enings) and cortical tonus thus main- 
tained at an exceptionally high level 
during NREMS, we might expect that 
there would be less need for hypo- 
thetical Tecovery of cortical tonus 
through REMS, While this has not 

cen systematically studied, an experi- 
ment by Rechtschaffen and Verdone 
(1964) Provides some data which 
may be relevant. These investigators 
showed that the typical “first-night 
effect” in the sleep laboratory (when 
Ss show decrements in REMS-time 
aS compared with subsequent nights) 
Is related to the greater number O 
brief awakenings which also occur on 
the first night. Since such multiple 
awakenings seem 
erally heightened level of vigilance 
(probably due to the unfamiliar condi- 
tions of the experiment), one interpre- 


tation of these results might he that ? 
need for restorati 


to represent a gen- 


on of cortical func- 


eos Weitzman et al, (1965) report con- 
Tasting findings in monkeys, however. 
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tions through REMS was rendered 
less necessary by the already alerted 
state of Ss. : 
_ Also relevant to the present hypothe- 
Sis, a study by Roldan, Weiss, and 
Fifkova (1963) suggests a recurrent 
homeostatic interplay between NREMS 
and REMS. Studying the rat, these 
authors identified a well-defined cyclic 
Sequence of EEG patterns repeated 
regularly throughout sleep. This se- 
quence consisted of three consecutive 
Stages: NREMS, REMS, and a brief 
arousal reaction regularly occurring 
following REMS. By systematic regu- 
larly repeated electrical stimulation of 
the mesencephalic reticular formation, 
Or acoustic stimulation, it was possible 
to demonstrate that in the course of 
Spontaneous sleep, cyclic changes in 
excitability paralleled the cyclic EEG 
Changes, A gradual decrease in ex- 
Citability in the reticular activating 
System occurred during the NREM 
Dhase, dropping out altogether in the 
panne period and during REMS. 
c llowing the brief arousal period on 
ermination of REMS, the entire cycle 
recommenced with greatly increased 
excitability displayed during the first 
Stages of the new NREM phase. In 
‘eeping with the findings of Roldan et 
al, Snyder? reports a similar cycle 
of changes in pulse and respiratory 
Tates in the opossum which regularly 
Teach peak levels in the brief arousal 
Period following REMS, gradually fall- 
ing once more in the NREM period. 

One interpretation of the findings © 
Roldan and his associates might be 
that REMS serves to restore cortical 
functions in these animals to 2 degree 
Sufficient to produce the prief arousal 
and subsequent lowered arousal thresh- 
Olds of the post-REMS period. . 

To turn now to the relationship be- 
tween REMS and deafferented states, 


" *8 Frederick Snyder, personal com 
ation, Tune 10, 1965. 


muni- 


a means of studying this problem 
might be to investigate the effects of 
experimentally reduced sensory input 
on REMS. While there has been no 
research to date on the effects on 
REMS of sensory and/or perceptual 
deprivation, Wood has explored the 
effects of social isolation on sleep pat- 
terns (Wood, 1962). This investi- 
gator locked Ss in a room empty save 
for two chairs, a table, and reading 
matter, permitting them no contact 
with other persons for the duration of 
their waking hours. The percentage 
of time spent in REMS on the night 
following isolation was then measured 
in the sleep laboratory. Results showed 
that REMS time on the postisolation 
night was approximately 60% greater 
than the mean percentage REMS time 
for four base-line nights ($ = e OL), 
in all Ss studied; and subsequent stud- 
ies have shown this result to be un- 
related to compensation for “first-night 
effect.” #4 An increase in REMS fol- 
lowing social isolation (which in this 
experiment included incidental partial 
sensory deprivation ) is consistent with 
the hypothesis that REMS serves to 


enous sensory input in the 


supply endog! 
absence of adequate input from the 


environment. 


Implications for Research 

In closing, we would suggest that ex- 
ters working on the psycho- 
p devote attention 
to the recurrent interplay between 
the sleep phases. To date, relatively 
little work has been done on this aspect 
of sleep research, with of coarse cer- 
tain exceptions (Agnew, Webb, & Wil- 
liams, 1964; Roldan et al., 1963; and a 
few others), most investigators focus- 
ing their attention on studying the sleep 
phases as discrete phenomena. 

Tt would seem that research attempt- 


Wood, personal communication, 


perimen 
physiology of slee 
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ing to relate the phases to each other 
may be fruitful. For example, atten- 
tion to the alternate lowering and rais- 
ing of cortical tonus in NREMS and 
REMS, respectively could lead to a 
comparison of these sleep states on a 
number of variables related to cortical 
efficiency. Such investigations should 
bring about a more profound under- 
standing of these sleep forms than the 
study of either in isolation, particularly 
if attention is directed to the continuum 
represented by the NREMS-REMS 
cycle (as in the study by Roldan et al., 
In addition, attempts to meas- 
ure broader tendencies toward afferen- 
as these may 
fluctuate throughout sleep (frequently 
boundaries of 


certain phenomena, such as reporting 
NREMS, which at 
"out of phase" and diffi- 
Cult to assess with respect to their role 


E of REMS may help 
to clarify the relationship 
eye movement sleep 


ments for Sensory stimulation, 
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cial reinforcement history itself (eg., 
the frequency, intensity, and variability 
of past social rewards). . : 
Unfortunately, most investigators 
who have noted the importance of such 
parameters of the past history of social 
reinforcement see their effects. almost 
solely in terms of creating differences 
in initial response strength upon which 
contemporary "shaping" procedures 
must be imposed (see, e.g., Staats & 
Staats, 1963). Such learning ap- 
proaches typically fail to take into 
account the recent evidence on the 
social psychology of the psychological 
experiment gathered by investigators 
such as Riecken (1962), Orne (1962), 
and Rosenthal (1963), which indicates 
that situations such as those studied 
in the verbal conditioning paradigm 
represent *, , . an interpersonal event 
involving all the intricacies of mutual 
social control [Jones, 1964, p. 16].” 
For example, Riecken (1962) suggests 
that experimental outcomes are fre- 
quently more dependent on an implicit 
process of interpersonal bargaining be- 
tween the subject (S) and the experi- 
menter (E) than they are the result 
of formal task requirements ; that is, S 
may or may not be willing to exchange 
doing what he perceives E wants him 
to do for a favorable evaluation. 
The present conceptualization of the 
impact of the individual’s history of 
social reinforcement on his present re- 
ceptivity to social reinforcement sug- 
gests that a person’s past history pro- 
duces an internal norm or frame of 
reference which importantly influences 
the nature of the interaction between 
the subject and the reinforcing agent. 
Although any number of the major 
parameters of an individual s history of 
social reinforcement may be implicated 
in this process, for the purposes of the 
present exposition attention will be 
focused on the percentage or rate of 
past social reward as the major deter- 
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minant of what will be referred to as 
an individual’s Social Reinforcement 
Standard (SRS). 

Conceptually, the notion of such a 
standard bears a strong resemblance to 
Helson’s (1964) notion of an adapta- 
tion level (AL), and, indeed, Bevan 
(1963) has extended the notion of an 
AL to the area of nonsocial reinforce- 
ment. The crucial differences between 
the AL notion and the present concept 
of an SRS are as follows: (a) ALs 
have been restricted to the intensity 
dimension of stimulation; and (b) ALs 
have always been looked upon as neu- 
tral points or points of psychological 
indifference. The present notion of a 
social reinforcement standard, on the 
other hand, posits that the kind of in- 
ternal referent that is established on 
the basis of past social reinforcement 
schedules is more than a neutral sum- 
mary of one’s past social reinforcement 
experiences; it is rather a preferred 
region around which one seeks to se- 
cure future social reinforcement. Such 
standards may be viewed as techniques 
of interpersonal uncertainty reduction. 
They allow us to smoothly coordinate 
our actions and interactions even with 
unknown others, Harvey (1965, P. 
249) goes even further and suggests 
that a refutation of 


- one's standardized construal of a situa- 
tion would be equivalent to a threat to the 
self, would portend negation of one's as- 
sumed means-end relations, and, if severe 
enough, would render the world chaotic an 
unstructured, destroying in the process one s 
means of fate control and techniques of de- 
fining and coping with his environment. 


It may then be assumed that an in 
dividual's past history of social rein- 
forcement defines for him a base line 
against which the adequacy or appro- 
priateness of present social reinforcer 
inputs is judged. For the moment sj 
Will be further assumed that the leve 
at which a person's social reinforce- 
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ment thermostat is set will be isomor- 
phic with the objective features of his 
past history of social reward. For 
example, it might be assumed that for 
an individual whose past frequency of 
social reward for achievement-oriented 
behavior was at a rate of approximately 
66%, a region of reward ranging be- 
tween 60% and 70% defines what an 
appropriate rate of social reward will 
be. The width of this range is assumed 
to vary with a number of factors, such 
as the variability which characterizes 
the individual’s past history of social 
reward. For example, highly regular 
schedules may be assumed to produce 
rather narrow and rigid criteria as to 
what constitutes an acceptable rate of 
social reward. 

Following Goffman (1959), it is fur- 
ther assumed that an individual does 
not merely define what he considers an 
appropriate rate of social reward and 
Wait passively to sce if the social en- 
vironment will live up to these expec- 
tations. He communicates to others 
his notion of what an appropriate rate 
of social reward is for a given class 
of behaviors by means of his “self- 
Presentation" ; that is, by his selective 
display of certain attributes or be- 
haviors in an attempt to influence the 
rate, direction, or type of social reward 
that he is to receive. The individual's 
Use of various self-presentations may 
be seen as a defensive stratesy de- 
Signed to cope with potential, as well 
as actual, disparities which may occ: 
between his social reinforcement stand- 
ard and the contemporary rate of s0- 
cial reinforcement. Thus, the attri- 
butes selected for display art likely to 
be ones which will clearly influence the 
favorability of the impression 5 iS m. 
ing. That is, the individual is assumed 
to display those personal attr ibutes 
which are indicative of his ability to 
Satisfy the needs of 4 particular other. 
For example, let us assume that most 
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Ss enter a typical learning experiment 
with a social reward standard that calls 
for a moderately high rate of social 
reward for achievement-related be- 
haviors. Furthermore, it may be as- 
sumed that Es who are college instruc- 
tors are particularly likely to give the 
impression, whether they intend to or 
not, that they expect and value rapid 
learning (see Rosenthal, 1963, for a 
discussion of E's hypotheses as unin- 
tended determinants of experimental 
results). In such a situation, to the 
extent that S perceives that E wants 
to think of himself as a good instructor, 
a rapid rate of learning or conditioning 
may be interpreted as a self-presenta- 
tion strategy designed to please E, and 
thereby insure S his desired rate of 
social approval. 

Tt is assumed that a self-presentation 
is particularly likely to take the form 
of variations in task performance when 
the difficulty of the task set for S is 
somewhat below his skill or capacity, 
since it is under these conditions that 
the entire range of performance from 
excellent to poor is under S’s volun- 
tary control (see Riecken, 1962, for an 
elaboration of this point). Tt may also 
be noted that the work of Jones, Ger- 
gen, Gumpert, and Thibaut (1965) in- 
dicates that Ss are significantly more 
likely to display attributes. (e.g. traits 
and opinions) which E indicates he 
values when the standards for task 
evaluations are subjective rather than 
objective. 

Typically, labor: 
et al. (1965) the openness or 
f the context of performance 
evaluation was varied orthogonally. In the 
open or subjective-evaluation condition it 


was made clear to Ss that there were no 
specific criteria for correctness other than 
the supervisor's judgment, while in the closed 
or objective-evaluation condition Ss were in- 
formed that the supervisor would have a list 
of the correct answers in front of him as he 


evaluated their solutions. 


atory studies of so- 


3]n Jones 
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cial reinforcer effects require. Ss to 
make verbal or sensory-motor discrimi- 
nations which are rather simple. Fur- 
thermore, S is usually confronted with 
a task for which the criteria for whether 
he has made a right or wrong response 
are left rather vague. Under such con- 
ditions, it seems likely that at least part 
of the response variance can be ac- 
counted for by S’s use of the “arts of 
impression management” to insure a 
rate of social reinforcement which he 
deems appropriate, given his past his- 
tory of social reinforcement. 


DISCREPANCIES FROM THE REINFORCE- 
MENT BASE LINE 


Investing the notion of a social re- 
inforcement standard with so much mo- 
tivational relevance leads to the follow- 
ing hypothesis : Substantial disparities 
above, as well as below, the person’s 
SRS will produce a state of negative 
affective arousal with a concomitant 
activation of various self-presentations 
designed to eliminate such disparities, 
Whether small or moderate departures 
from this base line are also likely to 
produce negative affect, regardless of 
their direction, is a more complicated 
issue. The “butterfly curve” hypothe- 
sis of McClelland, Atkinson, Clark, and 
Lowell (1953) and the experimental 
investigations of Haber (1958) in the 
realm of aesthetics, as well as simple 
sensory experiences, suggest that small 
discrepancies from a base line in either 
direction produce positive affect, while 
larger discrepancies produce negative 
affect. Further, stimulus inputs at the 
base-line level are assumed to produce 
the negative affect of boredom, 

Two kinds of factors weigh against 
our generalizing these notions to the 
functioning of a social reinforcement 
base line. First, it hardly seems plausi- 
ble that when the stimulus series in- 
volves social rewards, information 
which confirms the accuracy of our 


construal of a social situation will re- 
sult in the negative affect of boredom. 
Second, the butterfly-curve hypothe- 
sis implies that it is only the magnitude 
of discrepancy that is crucial, not the 
direction. Contrary to this assumption, 
Harvey and Clapp (1965) have experi- 
mentally demonstrated the importance 
of taking directionality into account, at 
least when the stimulus series has im- 
plications for the self-esteem of S. 
Their work indicates that even small 
negative discrepancies from a person s 
self-construal may produce negative af- 
fect, while positive discrepancies of 
even moderate size produce positive 
affect. Since social reinforcement in- 
put obviously has relevance to the self- 
esteem of S, it seems necessary to take 
into account the directionality of the 
discrepancy, at least when dealing with 
small- or moderate-size disparities. 
However, as we will attempt to demon- 
Strate below, it will be assumed that 
large disparities from a social reinforce- 
ment base line, regardless of their di- 
rection, produce negative affect. 


AN EXCURSION INTO THE PsvcnuoLoGY 
OF OVERREWARD 


The notion that, where the disparity 
Írom the social reward standard b 
large, even positive discrepancies will 
produce negative affect, and, hence, ac- 
tivate a self-presentation strategy de- 
signed to lower the level of social ap- 
proval and/or increase the level of 
social disapproval, seems both counter- 
intuitive and contrary to the positio? 
that most “game-theoretic? or “eX- 
change” oriented investigators are 
likely to take: Therefore, we will de- 


* For example, Thibaut and Kelley (1959); 
in elaborating their "exchange" model 9 
Social interaction, assume that the ennen 
level of social reward does not become n 
preferred or optimal level of reward—t ch 
individual always seeks a relationship on 
is better than the average relationship. T 
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ed considerable pains at this point to 
Tess this derivation. 
Deu ens for such a derivation may 
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(1960) finding that persons of low 
self-esteem are less able than persons of 
high self-esteem to accept favorable 
evaluations from others may be inter- 
preted in these terms. 

Another class of factors which may 
cause rewards which are greater than 
those previously encountered to pro- 
duce psychological discomfort is as- 
sumed to stem from considerations of 
equity and “distributive justice” (see 
Adams, 1963; Homans, 1961). These 
conceptions reflect the following kinds 
of phenomena. The person feels that 
the amount of effort he has expended 
or the quality of the work he has per- 
formed does not merit an increased 
level of reward because relevant posi- 
tive-reference persons whose perform- 
ances are similar in quality and involve 
equal effort are being rewarded at a 
lower rate. It may also be noted that 

reater than previously encountered re- 
ward levels may be rejected because 
they raise questions as to the credibility 
of the reinforcing agent—they raise 
suspicions as to his possible ulterior 
The recent critiques of dis- 
red by Rosenberg 
Janis and Gilmore (1965) 
this point. Moreover, 

and Gergen (1963) re- 
ort that a too ready reinforcement of 
the correctness of another's opinions 
can, under certain conditions, result in 
an increase in negative affect toward 
the overly compliant 
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tualization. The major purpose of 
such a selective sampling is rather to 
provide examples of the kinds of situ- 
ations for which the present notions 
might be profitably taken into account, 


Studies Involving Negative Discrepan- 

cies 

If normal children may be assumed 
to enter an experiment with a history 
of a fairly high rate of positive social 
reinforcement for well-behaved inter- 
actions with adults, we may expect that 
for such children preexperimental en- 
counters involving isolation, nonsup- 
port, or negative reinforcement will 
create negative affect, and hence trigger 
self-presentations designed to raise the 
rate of social approval being received, 
Since all of the studies to be cited in- 
volve learning tasks that are of a sim- 
ple, nonchallenging nature, it will be 
argued that these self-presentations are 
likely to take the form of task behaviors 
which are likely to result in an incre- 
ment in social reinforcement: that is, 
it is expected that high rate of condi- 
tioning will be found in a social rein- 
forcement situation which immediately 
follows the under-reward experience. 

Thus, the increased Tate of condi- 
tioning to a social reinforcer following 
a period of social isolation found by 
Gewirtz and Baer (1958a, 1958b), 
Erickson (1962), and Lewis and Rich- 
man (1964) all fit the present view 
of the processes underlying social re- 
inforcer effectiveness in certain speci- 
fied situations. The relevance of the 
present approach is also Supported by 
(a) the superior learning found by 
Hartup (1958) following a sudden 
withdrawal of a nurturant relationship, 
(b) the increments in the incentive 
valve of a social reward following a 
negative reinforcement found by Lewis 
and Richman (1964) and (c) the find- 
ing of Walters and Ray (1960) that, 
for a child, being left alone with an 
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adult stranger who ignores him in- 
creases the effectiveness of subsequent 
social reward. Cohen, Greenbaum, and 
Mansson's (1963) finding that, 'even 
with college students, the sudden with- 
drawal of social reinforcement after an 
initial high rate of approval results in 
a more rapid rate of verbal condition- 
ing ina subsequent reinforcement sit- 
uation than occurs for Ss who did not 
experience such a negative discrepancy 
also fits the present assumptions. 

Additional evidence in support of the 
present approach may be found in a 
study by Gilmore and Zigler (1964) 
involving the effects of social reinforce- 
ment and nonreinforcement on the task 
Dersistence of first- and latter-born 
children. Gilmore and Zigler's as- 
sumption that first-born children have 
had past social reinforcement experi- 
ences involving a high rate of social 
approval makes their findings particu- 
larly relevant. These investigators 
found that first-born Ss experienced a 
good deal more discomfort under non- 
reinforcement than positive reinforce- 
ment and manifested behavior which 
can be interpreted as a strategy to rase 
the rate of positive reinforcement they 
Were receiving; that is, they showed 
significantly more persistence on a bor- 
ing task in a nonreinforcement than in 
a reinforcement treatment. This per- 
Sistence may be seen as a tactic of self- 
presentation used by the first-borns to 
inform E that they are good, conscien- 
tious Ss, and therefore should be given 
more social approval, 

The author’s supposition that in the 
above kinds of task situations S's at- 
tempt to validate his SRS may be 8 
more important determinant of his task 
performance than any problem-solving 
Set which is part of the “demand char- 
acteristics" of the conditioning situation 
is directly supported by the following 
observations of Lewis and Richman 
(1964). Working with third-grade 
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elementary school children, they found 
that Ss who had preexperimental ex- 
periences of social isolation or negative 
reinforcement appeared to “. . . be less 
interested in a solution to the problem, 
and more interested in obtaining social 
reinforcement [p. 256].” 


-Ositt E " 
Positive Reward Discrepancies 


Some studies that may be interpreted 
as involving positive S’s SRS will now 
be examined. A number of investiga- 
tors have found that schizophrenics ex- 
Perience discomfort and do not condi- 
tion or perform well when positive-re- 
inforcement procedures are used. A 
recent study hy Berkowitz (1964) is 
Particularly striking in this connection : 


Reaction times of 39 chronic schizophrenics 
ere tested with and 


and 39 normal adults w 
Without reinforcement after exposure to 
Of 3 conditions of contact with E: (a) a0 
Contact, (b) contact with an unresponsive 
and aloof E, and (c) contact with a warm 
and accepting E [p. 522]. 
Contrary to Berkowitz's original hy- 
Pothesis, the schizophrenics in the 
ick and 

ie Some investigators, notably Rodnick ara 
Jarmezy (1957) and their associate ch 
Suggested that social censure ene Ana 
rupts rather than improves the perform: 
i Schizophrenics. The majority © 
n this area, however, do not SUP rS ar 
Position of Rodnick and Garmezy. 5 
een a recent review. article us 
i ited. y 

ang (1965) may be c! i dhis area 


QUt that, taken together, uem ac that the 
- + do not support the hy?" usted by 


e 2 
Di 'rformance of schizophrenics is 51^ jus 


(pS) 5. 
ue g) earlier 
E (1965) dicted 


Positive results have either been co! ated 

Y later results, only weakly corro? re inter- 
ater results, or are open to alterna abe 
Pretations because of methodologic@ geste d 
Ks Furthermore, as has been umber o 
st Ove in the text of this paper, ? i — mild 
veis have demonstrated that Co mezy 
M al censure of the type used facilitated 
ine Rodnick and their stt ene [Buss & 
Ins Performance of schizophrenics 


g, 1965, p. 11]." 
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warm condition had significantly slower 
reaction times than those in the aloof 
and no-contact conditions. A number 
of studies of verbal conditioning have 
found that schizophrenics show better 
conditioning when nonreinforcement or 
negative reinforcement is used (eg., 
Cohen & Cohen, 1960; Leventhal, 
1959). Similarly, Atkinson and Rob- 
inson (1961) and Koppenhaver (1961) 
compared verbal reward and punish- 
ment in a paired-associates learning 
task and found that punishment im- 
roved schizophrenic learning relative 
to that obtained with reward. 

If it may be assumed that schizo- 
hrenics have had a premorbid social 
reinforcement history characterized by 
nonreinforcement and/or negative re- 
inforcement for a variety of behaviors, 
then the above results may be inter- 


preted as follows: Schizophrenics eval- 
uate positive social reinforcement as an 
Thus, 


e state of affairs. 
ning or slow rate of con- 
ditioning found with schizophrenics 
when positive reinforcement was used 
may be seen as a defensive strategy de- 
signed to compel the reinforcing agent 
to produce the kinds of reinforcer 
events with which schizophrenics are 
likely to feel comfortable. That is, the 
schizophrenic who fails to learn un- 
der conditions of positive reinforce- 
ment may be seen as utilizing a self- 
presentation designed to convey the im- 
ression that he is stupid and inept, 
presentation of self which is likely 

nonreinforcement and/or 
forcement necessary to 


RS 


inappropriat 
the poor lear: 


a 
to elicit the 
negative rem 
cre nn may also be adduced to 

Dine "normal" Ss find sub- 
itive discrepancies disturb- 
Ina recent study Marlowe (1 962) 

f r 10 minutes of receiving 
t for making posi- 


100% 1 references his Ss showed a 


decline in the rate of emis- 
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sion of such utterances, with one S 
spontaneously commenting, "it's aw- 
fully hard to sit here and toot your 
own horn [cited in Crowne & Mar- 
lowe, 1964, p. 61]." This lowered 
rate of conditioning may be seen as a 
strategy adopted by Ss to force E to 
lower the rate at which he was reward- 
ing positive self-references, because 
after a while Ss view the high rate of 
reward as inappropriate, Further, an 
experiment by Aronson and Carlsmith 
(1962) demonstrates that even when 
S’s reinforcement history is defined in 
terms of negative-reinforcement en- 
counters experienced in a laboratory 
situation, a sudden confrontation with 
a highly rewarding state of affairs pro- 
duces discomfort and indeed, when 
given the Opportunity, Positive-discrep- 
ancy Ss try to lower the level of their 
performance score, 

Finally, the present author, in an 
exploratory study concerned with the 
effects of degree and direction of re- 
ward discrepancy, found that for Ss 
who initially had been reinforced at a 
33% rate for a particular class of emo- 
tional labeling Tesponses to a facial 
photograph a switch to a 66% rate of 
social reinforcement produced a more 
positive state of affect and a higher 
rate of emission of the reinforced re- 
sponse class than did a switch to a 
100% rate of social reinforcement, 


OPERATIONALIZING THE SRS 


Before closing the present formula- 
tion, a brief discussion of possible 
means of operationalizing the notion of 
a social reinforcement standard seems 
in order. Two widely divergent ap- 
proaches may be used. The first in- 
volves the development of various tech- 
niques to assess Ss’ past histories of 
social reinforcement ; the other involves 
generating a set of experimental para- 
digms for creating in the laboratory 
histories of social reinforcement for be- 
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haviors whose occurrence is not tied 
io any clear expectation of social re- 
ward or punishment. Some examples 
of each of these approaches will be pre- 
sented, and then a paradigm developed 
by the author will be described in some 
detail. 

Any of the many approaches that 
have been used to assess the prior his- 
tory of social reinforcement of Ss could 
be used to infer their SRS given the 
assumption of an isomorphism between 
the SRS and the objective properties 
of Ss’ actual reinforcement histories. 
For example, a number of investiga- 
tors have constructed history-of-rein- 
forcement questionnaires which de- 
Scribe behavioral situations and then 
ask S to choose from a list of parental 
reaction such as kisses, isolation, 
deprivation of privileges, etc. A vari- 
ant on this approach asks S to infer 
how a particular parent felt when S 
engaged in various behaviors (see, e.g. 
Crandall, 1963). Other investigators 
have used objective biographic data. 
Zigler (1961) inferred degree of prior 
deprivation of social reinforcement by 
having judges do a blind coding of 
detailed biographic data obtained about 
S’s early life; for example, information 
was gathered on whether S’s parents 
were separated when he was a child, if 
the parents were ever institutionalized, 
Separated, divorced, ete. Similarly, 
Crandall, Good, and Crandall (1964) 
used school records as indexes of the 
child’s probable history of reinforce- 
ment in everyday achievement experi- 
ences. 

Since Crandall ( 1963) found a sig- 
nificant positive correlation between Ss 
Seneralized expectancies of success 
in her task situation and her ques- 
tionnaire measure of social rein- 
forcement, and Crandall et al. (1964) 
found significant positive correlations 
between the child's actual performance 
in school and his generalized expec- 
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tancy of success, it seems that the as- 
sumption that there is an isomorphism 
between S's actual reinforcement his- 
tory and his standardized construal of 
that history is a reasonable one. Fur- 
thermore, since Crandall (1963) found 
S's Generalized Expectancy (GE) a 
better predictor of present social rein- 
forcement effectiveness than her ques- 
tionnaire measure, it may be possible 
to infer S’s SRS by obtaining expec- 
tancy estimates prior to the perform- 
ance of a given task. For example, Ss 
might be asked to estimate the proba- 
bility of their receiving social approval 
or disapproval for a given behavior in 
a variety of social reinforcement con- 
texts such as with peers and high- and 
low-status others. 

While the assessment approach com- 
plemented with a measure of S's GE 
provides a promising start to finding a 
valid set of procedures to operationalize 
S's SRS, it should be noted that the 
correlations described above, while sig- 
nificant, were sufficiently small (be- 
tween 21 and 32) to leave room fora 
good deal of unaccounted-for variance. 
It is therefore suggested that greater 
variance control might be obtained i 
expectancy estimates were obtained in 
conjunction with the creation of a 
social reinforcement history de novo 
in the laboratory. A general paradigm 
designed to accomplish this has been 
developed by the author an is de- 
Scribed below. 


9 The term Generalized Expectant y fines 


M 1954). 
be traced to Rotter ve the expectancies 


GE as “th UE 
e generalization Of " 
for the same or similar reinforcements à 
Occur i ituation as occur 
in a present situ sctionally 


past situations for the same, o 
related behaviors [p. 166] 2" otter : 
to suggest that the GE effect W! Aon 
more heavily in novel situations than * 
those in which S has had a series 
ences, Tt is this aspect of the 
that makes it particularly use 
Purposes. 
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ExPERIMENT I 
Method 


A verbal conditioning task was used which 
required that S judge the emotional state of 
a person from a facial photograph. S se- 
lected an emotional label from a list of 6— 
love, joy, contentment, anger, fear, disgust. 
The labels represented two general response 
classes: positive and negative emotions. The 
task was presented to S as having a twofold 
purpose: 

1. First, the task was described as an at- 
tempt to validate a universal list of emotional 
categories developed by the experimenter. 
(This instruction was designed to heighten 
the level of mutual interdependence that ex- 
ists in the E-S relationship.) 2. Second, S 
was told that performance on this task is 
a good measure of social sensitivity in the 
real world, and that his responses would be 
compared with norms obtained at the univer- 
sity. (This instruction was designed to in- 
crease the ego involvement of S. The 
pictures used were taken from a past Wayne 
State University yearbook and were selected 
so as to be neutral in expression; that is, 
pictures having obvious cues as to the emo- 
tional state were eliminated. (This was done 
to break down any possible strong stimulus 
binding and therefore to make S's judgments 
particularly susceptible to influence by E's 


reinforcements.) 


The following experimental procedure was 


then instituted. An operant series consisting 
of 15 nonreinforced trials was rum. During 
this series, E showed Ss 15 pictures, one at 
a time, giving them 10 seconds to make a 
The response class (ie, positive 
emotional labels) that was least 

the operant series was desig- 
rect" one for purposes of fu- 
After the operant series 
nd before the reinforcement 
treatments Were given, E obtained Ss' esti- 


mates of how well they expected to do on 


i k (GE measure). 
Hi ips or the building up 


An acquisition serie. ; 1 
of a social reinforcement history period fol- 


erant trials. Ss were randomly 
lowed e Rd her a 100% or a 33% fixed- 
i t-reinforcement condition. 
involved 60 trials— 
5 trials at either of the two 


blocks of 1 1 , 
ue The administration of social 


base-line rates. 


reinforcement was sl 
“that’s good” or “that's fine” for the correct 


response on one OF the other of the above 
schedules. After the acquisition series was 
completed, Ss were asked to estimate (a) 
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TABLE 1 
SHIFT SERIES 


Acquisition 100% 66% 33% 
rate 


100% 
33% 


the proportion of judgments on which they 
thought they actually were correct, and (b) 
the proportion they expected to get correct 
on future trials. 

The acquisition series was followed by a 
shift series involving 4 more blocks of 15 
trials each. During the shift phase, each of 
the original base-line groups was broken into 
3 subgroups generating the 2x 3 factorial 
design shown in Table 1, 

Following the shift series, Ss were once 
more asked to estimate how well they ac- 
tually thought they did and how well they 
expected to do on future trials. An extinc- 
tion series consisting of 4 blocks of 15 non- 
reinforced trials was then instituted. After 
the extinction series was completed, Ss were 
asked to make a final estimate of how well 
they thought they actually did. (A control 
group involving Ss making judgments for 
180 trials without any reinforcement was not 
included in the original pilot study, but is 
being run as part of a larger replication.) 

The conditioning phase was followed by 
an assessment period during which question- 
naire and interview data were collected. In- 
formation was gathered concerning S's affec- 
tive state during the experiment, periods of 
possible shifts in mood, and his general and 
specific evaluation of the experiment, E, and 
the task. (For the most part a semantic- 
differential format involving lists of bipolar 
adjectives was used to obtain this informa- 
tion.) Data were also collected as to S's 
perception of how ego involved he was, how 
much he wanted to obtain E's approval, and 
his notions about the purposes of the ex. 
periment. A detailed “awareness interview” 
patterned after those used by Dulany (1962) 
and Spielberger (1962) completed the ex. 
periment.? 


7 Although a sample of only 36 female sub- 
jects (6 in a cell) was used in the pilot 
study, some interesting, if tentative, trends 
emerged. First, for affective arousal and 
rate of conditioning it is clear that both the 
direction and magnitude of discrepancy from 
a reinforcement base line must be taken into 


SUMMARY AND CONCLUSIONS 


The present attempt to conceptualize 
the interpersonal processes underlying 
the postulated relationship hetween S’s 
Prior history of social reinforcement 
and his present receptivity to social 
reinforcement may be summarized as 
follows : 

1. The individual's past social re- 
inforcement experiences produce an 
internal norm or standard against 
Which the adequacy of present social 
reinforcements is judged. 

2. The individual's construal of such 
experiences (i.e., his standard of social 
reinforcement) is assumed to be essen- 
tially isomorphic with the objective 
properties of his past history of social 
reinforcement, 


3. The affective consequences of 
an individual's Teactions to present 
Social reinforcement are assumed to 
be a complex function of both the 
direction and magnitude of any dis- 
parity that may exist between the pres- 
ent rate of social reinforcement and the 
individual's SRS, 


4. Disparities which produce nega- 
tive affect (ie, rates of social re- 
inforcement Which are either sub- 

i coe 
account. Thus, while both conditions of 
moderate discrepancy (100%-66%, 33%- 
66%) produced less negative affect than the 
conditions of substantial discrepancy (100%- 
33%, 33%-100%), in every case the nega- 
tive-discrepancy conditions produced more 
negative affect than a Positive discrepancy © 
identical magnitude, The conditioning data 
yielded similar results. The conditions of 
moderate discrepancy (100%-66%, 33%- 
66%) each produced a higher rate of con- 
ditioning than the conditions of substantial 
discrepancy (100%-33%, 33%-100%). Fur- 
thermore, as with the affect data, each of 
the Positive-discrepancy treatments produce 
a higher rate of conditioning than did the 
negative-discrepancy treatments of equal 
magnitude. A more detailed report of the 
data yielded by this paradigm will be de- 
erred until the present replication, which 15 
utilizing à larger sample, is completed. 


* 
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oe above or below the SRS) 
an ssumed to result in the individ- 
iem : arying his self-presentation in 
oon a ay as to induce the reinforcing 
Sinh. reward him ata rate which he 
rate ar EE appropriate, that is, at a 
atel hich either matches or moder- 
y exceeds his SRS. 
"s When the context of evalua- 
is E vague, and the learning task 
ture "e simple, nonchallenging na- 
the A Bese seli-presentations may take 
iens orm of variations In task per- 
iainet fr whose direction is deter- 
form by S's assessment of what per- 
"mances are likely to allow him to 
mn Control over the social reward re- 
Sponsivity of the reinforcing agent. 
UN performance in such situations 
d ay be viewed as a defensive strategy 
ae ened to cajole E into reinforcing S 
t à rate of social reward with which 
e comfortable. 
Sut hoped that by adopting the per 
ive of S, we have made some case 
dis value of treating social ren’ 
inier effects as the result of ipm 
vol action between two equally 
Ved, if not equally powerful, actors. 
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or both. The A-B, A-C paradigm was employed, with A-B be- 


ing varied from 12-80 trials, and was learned either by massed practice 


The orderly development of a de- 
tailed interference theory of forgetting 
has been disrupted in a most disorderly 
but convincing manner: The data are 
not supporting certain predictions from 
the theory. The failures are centered 
primarily in the theory as extended to 
account for forgetting from extraex- 
perimental sources of interference (Un- 
derwood & Postman, 1960), and these 
failures are responsible for the present 
paper. A hypothesis to account for the 
failures has been developed, and an 
experimental test has been made of it, 
Some background is necessary in order 
to understand the origin of the hypoth- 
esis; more particularly, it seems neces- 
sary to state explicitly how the inter- 
ference theory has been extended to 
incorporate extraexperimental sources 
of interference. 

The general theory that forgetting 
results from interference among asso- 
ciations was developed primarily as a 
result of studies of retroactive inhibi- 


1 This work was supported by Contract 
Nonr-1228 (15), Project NR 154-057, be- 
tween Northwestern University and the Of- 
fice of Naval Research. Reproduction in 
whole or in part is permitted for any purpose 
by the United States Government. 


Recall of A-C after 24 hrs. 


A-B resulted in only small proactive inhibi- 


PI. Degree of A-B 


learning. Some 


tion (RI), in which the interference 
originated from interpolated associa- 
tions. In relatively recent years the 
discovery of proactive inhibition (PI) 
required extension of sources of inter- 
ference to include associations learned 
prior to the learning of the task to be 
recalled. From the results of studies 
of RI and PI, it seemed necessary to 
add additional mechanisms to account 
for differences in forgetting observed 
under various conditions, thereby ac- 
counting for variations in the magni- 
tude of the interference, Two mecha- 
nisms were added, namely, extinction 
of associations When successive lists 
involved conflicting associations (as in 
A-B, A-C), and the spontaneous re- 
covery over time of extinguished asso- 
ciations. ^ Some such mechanisms 
Seemed necessary to account for cer- 
tain results. For example, the increase 
in PI with increases in the length of 
the retention interval could be handled 
by the recovery of extinguished asso- 
Ciations in the first list which inter- 
fered increasingly with the recall of 
second-list associations, 

The extension to account for forget- 
ting from extraexperimental sources of 
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interference was done with complete 
fidelity to the theory as developed in 
the formal RI and PI paradigms. 
When a naive subject (S) learned a 
single list, it was presumed that the 
associations required conflicted to some 
extent with linguistic associations 
learned previously outside the labora- 
tory. "Therefore, learning of the list 
was assumed to produce some extinc- 
tion of the preexperimental associations 
which, with the passage of time, how- 
ever, recovered to produce interference 
(PI) at recall. The role of RI was 
minimized on the grounds that acqui- 
sition of conflicting associations during 
the retention interval was much less 
probable than having already acquired 
such associations prior to learning the 
list. 

The theory further specified. what 
conditions should produce variations in 
the amount of PI. Again, the trans- 
lation was made from the formal RI 
and PI situation. The greater the con- 
flict was between associations, the 
greater the RI and PI (eg- A-B, A-C 


versus A-B, C-D). Variations in num- 


ber and strength of conflicting asso- 
ciations were spelled out in terms of 


unit-sequence interference and letter- 
sequence interference, which in turn led 
concerning dif- 


to specific predictions d 
ferential rates of forgetting of. lists 
constructed to minimize or maximize 
interference with linguistic habits. 

It is not necessary to recite the rather 
agonizingly long list of experiments 
using different materials and drea 
types of tasks, all of which seeme n 
fit rather precisely the specifies o 
the theory, but which failed to show 
the predicted differences in a ps 
(eg., Ekstrand & Underwood, x 
Postman, 1961; Underwood & Keppel, 
1963). Thus, the theory which could 
claim considerable success in interpret- 
ing the results of the formal RI and PI 


paradigms was not supported and was, 


furthermore, left relatively impotent to 
account for the invariant amount of 
forgetting which did occur in the sin- 
gle-list experiments. 

In any reevaluation of a theory that 
is reasonably successful in one context 
but not in another, several alternatives 
are always available. However, since 
the theory originated in the formal RI 
and PI situations, a reexamination of 
the effect of certain variables in these 
paradigms looked potentially fruitful. 
More particularly, our interest focused 
on the strength of interfering associa- 
tions in the PI paradigm and the rela- 
tionship between this strength and the 
amount of PI. It was assumed, in the 
unsuccessful tests of the theory, that 
linguistic habits of long standing will 
be extinguished by relatively few trials 
on a list where the associations conflict 
with the linguistic habits. This as- 
sumption may be invalid. Perhaps, as 
Mandler (1962) has argued, high levels 
of learning of associations result in 
these associations taking on a charac- 
teristic (structure) which prevents 
them from interfering with other asso- 
ciations. No study is available on the 
relationship between PI and degree of 

hen the prior learning 


prior learning w 
has been taken to a very high level and 
where the retention interval is rela- 
tively long. 
In view of these considerations, the 
experiment was an attempt to simulate 
the effect on PI of associations which 
had experimentally been given charac- 
d to be possessed by 


teristics presume 

linguistic habits. Two such character- 
istics seemed potentially important. 
First, as noted, it must be assumed that 
linguistic habits are very strong and 
second, that linguistic habits have been 
developed by distributed practice over 
many years. Either or both of these 
characteristics might produce a reduc- 
tion in PI despite the fact that formally 
the linguistic associations and those 
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learned in the laboratory would ap- 
pear to be in conflict (A-B, A-C). In 
the experiment, therefore, the degree 
of A-B learning has been varied from 
a low to an extremely high level. In 
one set of conditions the level of pre- 
scribed learning was given in a single 
session. In another set, the learning of 
A-B was distributed over 4 days. All 
Ss learned a second A-C list which was 
recalled after 24 hours. 

The thinking which prompted the 
experiment may now be summarized 
and the implications of several possible 
outcomes evaluated. It was noted that 
past experiments have been unsuccess- 
ful in demonstrating differential PI 
when associations learned in the lab- 
oratory are in differential conflict with 
extraexperimental linguistic associa- 
tions. This is to say, the linguistic 
associations do not seem to “behave” 
in the same way as do associations 
established in the laboratory, It has 
been suggested that two quantitative 
differences between extraexperimental 
associations and laboratory associations 
may be involved in the discrepancy ; 
namely, extraexperimental associations 
have been learned to a very high level, 
and they have been learned by dis- 
tributed practice. The experiment, 
therefore, manipulated these two vari- 
ables for A-B in a formal A-B, A-C 
paradigm in an attempt to have A-B 
simulate the extraexperimental asso- 
ciations. The influence of these two 
variables on the recall of A-C is ex. 
amined, and, more particularly, it is 
asked if either of these variables js 
associated with a decrease in PI. Four 
relatively “pure” alternative outcomes 
of the experiment may be identified. 

1. If degree of learning of A-B is 
the only critical factor, PI of A-C must 
decrease as degree of A-B learning in- 
creases, and the amount of decrease in 
PI must be independent of massing 
and distribution of A-B. 
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2. If the distribution of practice on 
A-B is the only critical factor, the 
reduction in PI resulting from distrib- 
uted practice will be independent of the 
degree of learning of A-B. 

3. If both variables are involved in 
a reduction in PI, maximum reduction 
should occur when A-B is learned to 
a high level by distributed practice if 
the relationships are monotonic. There 
may or may not be an interaction of 
the effects of the two variables. 

4. Finally, neither variable may in- 
fluence PI, in which case other sources 
must be examined for an understand- 
ing of why the interference theory has 
not been supported when applied to 
extraexperimental associations. 


THE EXPERIMENT 
Method 


Lists. The two lists consisted of 12 paired 
associates in which the stimulus terms were 
three-letter words (medium frequency), and 
the response terms were two-syllable adjec- 
tives taken from Haagen’s (1949) list. The 
lists formed an A-B, A-C paradigm, and the 
order of learning the lists was the same 
for all Ss. 

Procedure. The conditions may be thought 
of as representing two experiments, one in 
which the A-B learning was massed (MP), 
the other in which it was distributed (DP)- 
There were four degrees of A-B learning 
in each experiment—12, 32, 48, or 80 trials. 
For the MP experiment the trials were all 
given in a single session and were imme- 
diately followed by the learning of A-C to 
one perfect trial. In the DP experiment, the 
A-B trials for each degree of learning were 
equally divided over 4 days, Thus, with 12 
DP, S had 3 trials on each of 4 successive 
days; with 80 DP, S had 20 trials on each 
of 4 successive days. On the fourth day, 
following A-B learning, A-C was learned to 
one perfect trial. 

Recall of A-C took place after 24 hours. 

relearned for 5 trials or the number be- 
yond this required to reach a perfect trial. 
Finally, a free-recall test was given. SS 
were supplied a sheet of paper on which the 
stimuli were listed, with two blank spaces 
under each, and were asked to supply the 
response terms from both lists and to identify 
(first or Second) the list to which each re- 
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sponse belo: imi i s : 
up cae CN time was given 
Mice was by the anticipation method 
zd ghout, the rate being 2:2 seconds, with 
: -second intertrial interval. The words 
ppeared in four different orders. The eight 
eg of college students were composed of 
ona b Pep: all Ss had served in at least 
Sesh erbal learning experiment prior to the 
REDE one. The assignment of Ss to the 
tiis 1-5 was random. dt was recognized 
le the groups having high degrees of A-B 
arning, particularly the 48-MP and 80-MP 
ED could become extremely bored with 
dis ask. Therefore special instructions were 
deno which emphasized the importance of 
fon SE to respond correctly on each repe- 
oe of the list. | The 48-MP and 80-MP 
FEM were promised an extra experimental 
all (toward the requirement imposed on 
students in elementary psychology) for 
un cooperation, However, no S was in- 
Re as to the number of trials he would 
80 given. A S-minute rest Was given the 
-MP Ss after the first 40 trials on A~- 


RESULTS 
A-B Learning 


All eight groups h 
trials on A-B under the same condi- 
tons ; therefore, performance on these 
trials may be used to establish the com- 
parability of the groups. The mean 
total-correct responses varied from 
18.40-22.25, and all Fs were less than 1. 

It was of some importance that S 
continued to anticipate correctly in the 
overlearning trials if the differences in 
degree of learning are to be effective. 
The average number of times each item 
Was given correctly was calculated. 
For the MP groups the values were 
10.00, 29.48, 44.60, and 7620 for 12, 
32, 48, and 80 trials, respectively- 
Thus, out of a possible total of 30, 
the average pair was correctly antici- 
pated 76.20 times for the MP-80 con- 


dition. The corresponding values for 
the DP groups were 9,68, 28.98, 45.24, 
and 77.32. For the higher degrees of 


learnin s 
g erformance 97 i 
been a eine at a slightly higher 
level for the DP 8r? 


ad the first three 


ups than for the 


CORRECT ON 
FIRST THREE TRIALS 


TRIALS TO CRITERION 


TRIALS ON A-B 


Fic. 1. A-C learning as a function of 
degree of A-B learning. (The upper panel 
gives the mean number of 
on the first three trials, 
mean number of trials to 


correct responses 
the bottom panel, 
one perfect trial.) 


This may be shown in an- 
other way. The percentage of trials 
on which all items were correct was 
calculated after the trial on which all 
items had been first correctly antici- 
The values for MP 48 and MP 
7 and 81.6%, and for the 


80 were 79.196 
corresponding DP groups, 89.796 and 


90.356. 


A-C Learning 

Two measures of second-list learn- 
ing (A-C)are plotted in Figure 1. The 
upper panel shows the mean number 
of correct responses on the first three 
trials, the lower panel, the mean num- 
ber of trials to criterion. A statistical 


analysis of the data plotted in the upper 


anel shows nO significant effect of 


either degree of learning or of MP-DP. 
The F for degree of learning is less 
than 1 (as is the interaction), that for 
MP-DP, 1.83, which is far from sig- 
nificant with df = 1/152. Analysis of 


MP groups. 
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the trials to criterion also fails to result 
in any variable being significant. The 
only F above 1 results from degree of 
learning (2.27), but with df = 3/152, 
an F of 2.67 is required for the 5% 
level. A trend analysis for unequal 
intervals (Gaito, 1965) still produced 
no significant component. It must be 
concluded that the wide variation in 
degree of A-B learning had no appre- 
ciable differential effect on A-C learn- 
ing. 

Certain considerations (e.g., Mand- 
ler, 1962) might lead to the expecta- 
tion that performance on A-C would 
improve with the higher levels of learn- 
ing of A-B. As noted, no statistical 
support of any change in A-C learning 
between 12 and 80 trials of A-B learn- 
ing has been found. There is no con- 
trol, such as A-B, C-D, to determine 
whether or not the transfer is positive 
or negative, However, if such a con- 


MEAN NUMBER RECALLED 


70 


60 


50 


PERCENT Loss 


40 


12 32 48 80 


TRIALS ON A-B 


Fic. 2. Raw recall scores (upper panel) 
and adjusted percentage of loss scores 
(lower panel) as a function of degree of 
A-B learning. 
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trol were used, to produce improved 
performance in A-C with higher de- 
grees of A-B learning would require 
that the C-D control show increasingly 
poorer performance as the number of 
trials on A-B increases. Available 
data (eg. Postman, 1962) suggest 
that this would not be found. 

It is known from another study (un- 
published) that the two lists do not 
differ in difficulty. Tt was noted earlier 
that on the first three trials of A-B, 
performance varied between 18.40 and 
22.25 correct responses. These are at 
best only slightly higher than the values 
seen in the upper panel of Figure 1. 
Across all conditions, the mean num- 
bers of trials to reach one perfect on 
A-B were 7.06 for MP, 748 for DP. 
The corresponding values for A-C were 
6.93 and 7.46. It should also be 
pointed out that if differences in learn- 
ing to learn are associated with dif- 
ferences in degree of A-B learning, 
A-C performance should be increas- 
ingly facilitated by this factor alone. 
In short, no evidence is found of any 
marked transfer effects as a function of 
degree of A-B learning. 

Finally, it may be noted that some 
intrusions of B responses occurred in 
learning A-C. Their numbers de- 
creased as A-B learning increased, be- 
ing 18, 15, 5, and 2 for MP: 11, 10, 
6, and 1 for DP 


Recall 


Two recall measures are plotted in 
Figure 2; the first to be considered 
will be the upper panel where the raw 
recall scores are plotted, An analysis 
of variance shows only one significant 
F, that for DP-MP (4923). The F 
for degree of learning was 1.27, an 
a trend analysis does not change the 
Conclusions. An analysis on the scores 
for the four MP groups alone pro- 
duces an F of 6.30 for the quadratic 
trend with a value of 3.97 required for 
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TABLE 1 
Errors oN THE RECALL TRIAL 


Trials on A-B 
Errors — aT m 
12 | 32 | 48 | 80 
Total 
MP groups 81 |48 | 35 |21 
Intrusions from 
A-B 47 | 43 | 27 |18 
% intrusions 2.2| $9.6) 77.1/85.7 
Total 
DP groups 28 |15 | 15 |10 
Intrusions from 
A-B 17 5 913 
% intrusions 60.7} 33.3] 60.0/30.0 
The 


the 5% level of significance. 
linear (F = 1.22) and cubic (F= 
trends were not significant. 
There were small but insignificant 
differences in rate of learning A- 
The direction of the differences, how- 
ever, suggested that recall differences 
might be underestimated somewhat, 
Particularly for the MP groups: There- 
fore, a multiple-entry probability analy- 
sis was applied (Underwood, 64) 
and percentage of loss scores derived 
for each S. The means of the loss 
Scores are plotted in the lower panel 
of Figure 2. With the loss scores, the 
quadratic component of the degree-of- 
learning effect produces an F of 4.06 
(df = 1/152), with an F of 391 re- 
quired for the 5% level of significance. 
The quadratic interaction, | however, 
falls considerably short of significance 
(F = 2.80). Nevertheless, any Way - 
viewing the plots in Figure 2 must lea 


to the conclusion that recall is pe 
tia i d by differences In °° 
lly uninfluence y wron 


gree of learning fo 

but is gea for the MP groups. 
Plots of percentage of — a 

function of number of times A- iw 

given correctly in learning ier | I 

the DP groups have higher recall tha 


21) 
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the MP groups at all levels of response 
strength. 

An analysis of trials to relearn A-C 
to one perfect recitation produced no 
significant source of variance, although 
the DP groups overall were somewhat 
superior to the MP groups (4.01 ver- 
sus 4.61 trials). 

Table 1 presents an analysis of er- 
rors on the recall trial. For both MP 
and DP, total errors decrease as de- 
gree of A-B learning increases. For 
the MP groups, the percentage of the 
errors which were intrusions varies 
from 77.1%-92.2%, with the percent- 
ages for the DP groups being appre- 


ciably lower. 


Free Recall 

For this measure the interest 1s in 
the recall of A-B following the learn- 
ing and, after 24 hours, relearning of 
A-C. A lenient scoring system Te- 
quires only that S produce the responses 
from A-B; a stringent scoring system 
requires that he also pair the response 
terms with the correct stimuli and cor- 
rectly identify the pairs as to list. The 
results for both scoring systems are 
plotted in Figure 3. The DP groups, 
by either scoring method, show nearly 
perfect recall. By the stringent scor- 


9 ———— LENIENT 
—— STRINGENT 


MEAN NUMBER CORRECT 
5 


80 


32 
TRIALS ON A-B 
. 3. Free recall of A-B for MP and 
get as a function of degree of A-B 
learning. 


12 
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ing system, only four, five, four, and 
four Ss in the successive groups failed 
to show perfect recall. The near-per- 
fect recall in the DP-12 conditions is 
particularly impressive when it is real- 
ized that two Ss had failed to reach a 
perfect trial within 12 trials. The MP 
groups clearly show a loss, of approxi- 
mately two items, but there is no dif- 
ference as a function of degree of A-B 
learning (F = 2.39). There was not 
an appropriate control group to deter- 
mine if the loss shown by the MP 
groups can be considered to represent 
significant extinction (significant RI). 
However, some estimates are possible, 
The DP groups on A-B learning pro- 
vide evidence on 24-hour paced recall 
with no interpolated list after the in- 
itial blocks of trials, that is, after 3, 8, 
12, and 20 trials on A-B. The recall 
scores for these four groups were 6.70, 
8.50, 10.00, 10.55, As can be seen 
in Figure 3, free recall of A-B after 
12 trials was approximately 9 items 
(versus 10 at the paced recall). With 
unpaced recall and no interpolated 
(A-C) learning it is suspected that the 
Scores would probably be perfect with 
32 or more trials, Tt is concluded that 


all MP groups show significant extinc- 
tion effects, 


IMPLICATIONS 


Four relatively unambiguous possi- 
ble outcomes of the experiment were 
identified in the introduction, As 
might be expected, the results do not 
match precisely any one of the four, 
Nevertheless, the second alternative, 
which stated that DP of A-B is the 
only critical factor involved in the re- 
duction of PI, is reasonably accurate, 
The difference in the amount of PI for 
the DP and MP conditions is large and 
holds for all levels of A-B learning. 
If we accept the assumptions that (a) 
extraexperimental linguistic associa- 
tions have been learned by DP, and 
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(b) present DP conditions have ade- 
quately simulated this learning, pre- 
vious failures to support the extension 
of interference theory to linguistic as- 
sociations become more understandable. 
This finding does not absolve the the- 
ory; rather, it points to a deficiency, 
since the theory has no explicit provi- 
sions to account for the reduction in 
interference produced by distributed 
practice on the interfering list. This 
matter will be examined at a later point. 

The effect of different degrees of 
learning of A-B on PI was somewhat 
ambiguous statistically, yet certain con- 
clusions seem appropriate. There was 
no appreciable effect of different de- 
grees of A-B learning under the DP 
schedule; recall following 80 A-B trials 
distributed over 4 days was essentially 
equivalent to the recall following 12 
A-B trials distributed over 4 days. For 
the MP groups, however, degree of 
learning did influence recall. Recall 
decreased between 12 and 48 A-B 
trials, with a small increase apparent 
betwen 48 and 80 trials. Therefore, 
it is tentatively concluded that degree 
of A-B learning as such is not criti- 
cally involved in the failure of extra- 
experimental associations to conform 
to expectations from interference 
theory. 

Returning to the results for the DP 
groups, the question is whether or not 
there was any PI. That the outcome 
of the DP conditions could make the 
need for a control group (having nO 
prior learning) so important was ufi 
foreseen, Unfortunately, when the 
need became apparent, there was 1° 
way to obtain the control Ss with back- 
ground experience in other verba 
learning experiments that would dupli- 
Cate the experience of Ss in the D d 
groups. All Ss in the DP groups ha 
Served in at least one previous experi- 
ment employing paired associates i# 
which three-letter words were used, an 
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most had served in two or more. The 
previous experiments in most instances 
had been conducted in the same room 
with the same memory drum by the 
same experimenters as the present one. 
Therefore, some proactive interference 
from these prior experiments must be 
expected, and this interference would 
have been present had a control condi- 
tion been run. When a naive control 
group is used, approximately 75% re- 
call may be expected after 24 hours if 
the list is learned to one perfect recita- 
tion (Underwood, 1957). The question 
is, of course, how much this value would 
be reduced had a control been run con- 
sisting of Ss with the same background 
of experience in verbal learning studies 


as those serving in the present condi- 
tions. Across all DP groups, recall 
ems) 


was 56%, using 12 (number of it 
as a base. We cannot be sure, of 
course, but we do not see how a con- 
trol group could have recalled more 
than 65%, and perhaps 60% is more 
accurate, In any event, if there 1S 

shown in the recall of the DP groups, 
it is quite unlikely to be more than 
10% in amount. Again the implica- 
tions of the conclusion that there is 
little if any PI shown by the DP 
groups may be noted. If it 1s assumed 
that extraexperimental linguistic asso- 
ciations have been learned with dis- 
tributed intervals far more extreme 
than those used here, it cannot be ex- 
pected that such associations will inter- 


fere at recall with associations acquired 


in a few trials in the laboratory gn 


though the associations learned in d 
laboratory are ostensibly in contie 
(A-B, A-C) with the extraexperimen- 


tal associations. 
iti uestion is why DP on 
The critical d ediy reduces 


the interfering list mar : 
the interference produced by that list, 


whereas MP does not. No complete 
answer can be given at the present 
time, but there are certain contingen- 
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cies in the data which may point to the 
critical mechanisms. The free-recall 
data for the DP groups indicated es- 
sentially no extinction of the A-B as- 
sociations, even with the lowest degree 
of prior learning, whereas there was 
clear evidence for extinction in the MP 
groups. Yet, the A-C learning of the 
MP and DP groups did not differ as 
might be expected if extinction differed. 
That the DP groups showed no extinc- 
tion is particularly impressive in view 
of the fact that there were two poten- 
tial extinction periods (learning and 
relearning of A-C). There was no 
extinction in spite of the fact that in- 
trusions from A-B were observed both 
in learning and relearning A-C, and 
intrusions have sometimes been asso- 
ciated theoretically with the process of 
extinction. The failure of extinction, 
therefore, is associated with a reduc- 
tion in PI. It is as if potentially inter- 
fering associations learned by DP do 
not enter into an extinctive relationship 
in the same way that associations 
learned by MP do. Interference the- 
ory has assumed that spontaneous Te- 
covery (increasing strength) of the 
A-B associations is in part at least 
responsible for an increase in PI as 
the retention interval increases. Yet, 
the DP groups maintained the interfer- 
ing associations at a high level of 
strength, but still there was little inter- 
ference at recall. It is possible that 
the critical role of DP could still be 
related to extinction effects which re- 
duce PI. Peripheral or weak extra- 
experimental associations tied to the A 
terms may be thoroughly extinguished 
by the DP schedule (Keppel, 1964), 
thereby leaving only the A-B associa- 
tion as a potential source of interfer- 
ence for A-C. Under such conditions, 
it may be possible to maintain the dif- 
ferentiation of the two response sys- 
tems at recall. The use of other para- 
digms without common terms (eg.. 
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A-B, C-D) would give evidence on the 
appropriateness of such an approach 
for revisions of the interference theory, 
revisions which seem necessary. 

Although a direction has been sug- 
gested which may be taken by inter- 
ference theory to accommodate the 
present findings, it is not intended that 
this be further pursued here, How- 
ever, partly by way of summary and 
partly as a means of extending the im- 
plications of the present results, fur- 
ther comments will be made: 

1. The interference theory of forget- 
ting has been extended to include in- 
terference produced by associations 
learned outside the laboratory, particu- 
larly associations learned prior to lab- 
oratory learning. The basic purpose 
of this extension was to allow an ac- 
counting of the forgetting shown by a 
naive S learning a single list in the 
laboratory. As noted in the introduc- 
tion, tests of the theory have been 
rather uniformly negative in their at- 
tempts to show that materials differing 
in the degree to which they conflict 
with associations acquired outside the 
laboratory will be forgotten at different 
rates. The amount forgotten by a 
naive S seems essentially independent 
of the nature of the material and the 
type of task and is sensitive only to 
the degree of learning. The present 
results strongly suggest that the source 
of interference cannot be linguistic 
habits of long standing, or, if they are, 
our methods of “bringing them into 
the laboratory” are quite inadequate, 
If we cannot identify specific sources 
of interference to account for the for- 
getting of the naive S, we are left with- 
out an adequate accounting of the for- 
getting shown. When we attempt in 
the future to identify such interference, 
it appears that we must make it inde- 
pendent of the nature of the material 
learned in the laboratory ; the relatively 
constant rate of forgetting of the naive 
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S across widely different materials and 
tasks does not provide an alternative. 
Until we are able to identify the source 
or sources of interference producing 

the constant forgetting in the naive S, 
the decay theorist has some grounds 
for glee. 

2. When an interfering list is dis- 
tributed over days, the interference is 
markedly reduced. This cannot result 
from a difference in the strength of 
association as we normally think of 
Strength, for clearly the 80-trial DP 
condition must have resulted in as- 
sociations of greater strength than in 
the 12-trial DP condition, but still 
PI did not differ. When learned 
under MP, very Strong associations 
still apparently interfered with the 
retention of A-C, The DP condi- 
tions not only resulted in fewer in- 
trusions in learning and recall than 
did the MP conditions, but total errors 
at recall were quite different in fre- 
quency. One of the continuing prob- 
lems of interference theory is “what to 
make" of errors, particularly intru- 
Sions. Are they to be taken as indica- 
tive of causal processes for forgetting, 
or are they merely the consequence of 
forgetting ? 

_3. There have been recent discus- 
sions in the literature concerning the 
effect of overlearning on transfer and 
forgetting (Jung, 1965; Mandler, 
1962, 1965), significant effect on 
A-C learning as a consequence of dif- 
ferences in degree of A-B learning was 
not found. How can an A-B, C-D 
control provide data which, when the 
difference between C-D and A-C learn- 
ing is examined, result in better A-C 
Performance with high degrees of A-B 
learning than with low? The recall 
Tesults for the DP groups gave no indi- 
cation of differences as a function of 
degree of A-B learning. The only data 
which would tend to support a concept 
of structure in overlearning would be 


è the small reduction in PI with the MP 
TA between 48 and 80 A-B trials. 
oar ipie the large reduction in 
sui iig for the DP groups might be 
^ to be in some way produced by 
by me developed in A-B learning 
devel and that this structure does not 
op in learning by MP. 
dapi present data show that widely 
red ibuted practice of A-B markedly 
uces the forgetting of A-C. Com- 
ade operations on A-C have also 
E shown to reduce markedly the 
» Setting of A-C ( Keppel, 1964). If 
! mn is widely distributed on both 
urth and A-C, will forgetting be still 
ther reduced, or is it the contrast 
ssuierentiation ?) between. practice 
eff €dules which makes distribution 50 
ective (Houston & Reynolds, 1965) ? 
hese problems, it seems, are some 
dut more important ones which 
ere be handled, not only by an inter 
nce theory, but by any other type 
is Lh. Some of the shortcom- 
Ory Tow the details of interference the- 
ing ave become gradually disconcert: 
3 nt the past few years; the presen 
these has merely verified certain js 
ma € shortcomings in a rather star’ 
nner, 


of 
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OPTICAL MOTIONS AND SPACE PERCEPTION: 
AN EXTENSION OF GIBSON'S ANALYSIS! 


JOHN C. HAY 
Smith College 


There isa perceptual tendency to see the 2-dimensional transformations 


of shadows as rigid movements in 3 dimensions. 


this illusion for veridical perception is investigated by a mathematical 


analysis of the correspondence betw 
ments and the optical motions they 
that optically equivalent object disp 
and that therefore the optical moti: 


ons they produce are potentially in- 
Some of the objective features speci- 


fied by optical motions are initial orientation, rotation, and the ratio 


between initial distance and translation distance. 


The analysis is based 


on coplanar sets of object points, and the fact that their displacements 
produce 2-dimensional projective transformations of the retinal image. 


The sharply defined shadow of a 
stationary Object, cast by a point 
source of light, may assume a variety 
of two- and three-dimensional appear- 
ances, like the black-and-white sil- 
houettes of Gestalt Psychology. But 
the transforming shadow of a moving 
object induces a more striking illu- 
sion: The shadow acquires a sub- 
stantial appearance, and looks to be 
itself an object moving in depth. 
(Metzger, 1934; Wallach, 1953) The 
situation is illustrated in cross section 
in Figure 1A. It is as if the visual 
system, when presented with the 
transformation of the shadow, re- 
sponded by resolving it into two 
components: a constant object struc- 
ture and a change of position in space. 

Gibson (1957) inferred from this 
phenomenon that the shadow pre- 
serves information about the object, 
not in its form, but in its transforma. 


1 This study is based on a doctoral thesis 
presented to Cornell University. _ Much of 
the work was supported by a United States 
Public Health Service Predoctoral Fellow- 
ship. J. E. Hochberg and J. J. Gibson pro- 
vided invaluable counsel, and Louise S. Hay 
and E. Wong checked the original mathe- 
matical treatment. 


tion. “Information” is relative, ol 
course, to a particular range of possi- 
bilities. Gibson proposed that the 
shadow transformation is i) 
tive with respect to the full range 0 
object displacements (rigid motions) i 
that is, rotation around, and transla- 
tion along, each of the three dimen- 
sions of space. He noted, first of all, 
that the optical motion induced at the 
eye by the shadow is the same as 
would be induced by the object if the 
latter were viewed from a suitable 
station point. In developing his 
analysis, Gibson concentrated on the 
case of flat object surfaces, By 
watching the shadow transformations 
caused by displacing flat objects: 
he inferred the following correspond- 
ence between object displacements 
and optical motions: 


Object Optical 
Displacement Motion 
Translation 


1. along line-of- — Magnification. 
sight. 
2. along lateral —> Translation 
(or vertical) axis, along lateral Ci ^ 
vertical) axis. 
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Rotation 
3. around line- — Rotation 
of-sight. around line-of- 
sight. 
4. around lateral Foreshortening 
(or vertical) along lateral (or 
axis. vertical) axis. 


Gibson hypothesized that the visual 
system is able to invert the above 
correspondence, and thereby identify 
object displacements from the optical 
motions registered at the eye. He 
found experimental evidence for this 
hypothesis by showing that sub- 
jects make certain kinds of judgments 
accurately by watching certain sim- 
plified shadow motions (Gibson & 
Gibson, 1957 ; Gibson, Gibson, Smith, 
& Flock, 1959; Schiff, Caviness, & 
Gibson, 1962). 

The present study € 
analysis by a mathe 
lation of the correspondence between 
object displacements and optical 
motions. The results show that the 
above one-to-one correspondence is 
an oversimplification, and at the 
same time an underestimate of the 
potential informativeness of optical 
motions. There are eight relevant 
parameters of optical motion rather 
than the six in Gibson's list, and these 
parameters may specify the initial 
position of the displaced object, as 
well as the displacement. 


xtends Gibson’s 
matical formu- 


PLAN OF THE ANALYSIS 
be organized 


The size of these "op 


classes determines thé m 
that might be mediated by optica 
is will be similar 


motions. The analysis Wi 


t ided b 
p het pene d The emphasis 


for stationary object. D f 
will be, however; not on the size © 


formation wi 


Moving 
Object 


Point- 
source 
Lamp 


Optical 


Fic. 1. A: Cross section of the moving 
shadow illusion. (The deforming shadow in 
the shadow screen produces the impression 
of a rotating object similar to the unseen 
shadow-casting object.) 

B: Objects undergoing optically equivalent 
displacements in a two-dimensional cross 
section. (P is the true object causing the 
optical motion at the eye; Pa and Pe: are 
two confusable objects undergoing optically 
t displacements. Algebraic analysis 
he three-dimensional situation, 
nfusable objects is much 


equivalen 
is needed for t 
where the class of co 
more restricted.) 


ivalence classes, but on 
teristics, and thus 


No assumptions will be made about 
or the trajectory 
ements. Instead, 


| relationship be- 


considered. 


relationship b 
stages of an optical motion. 

Figure 1B shows the case of several 
different object displacements pro- 
ducing the same transformation of 
the optical array (the sheaf of light 


© Maps > Optica 
Ran) e —À pesi 


CORRIDOR L-———— a E 


bject ot > gpa 
RNS pr ci 


Diagram of the mathematical 
analysis. 


Fic. 2. 


rays entering the eye). The diagram 
is a two-dimensional CrOSS section 
of the real situation. P stands for the 
"true" object of Figure 1A, under- 
going the "true" displacement D. 
Pa and Pez stand for two confusable 
objects, in the sense that they can each 
undergo a displacement, Da or De, 
which causes the same Optical trans- 
formation as P and D. The situation 
is comparable to the familiar case of 
different stationary objects being 
optically equivalent with respect to 
the eye (e.g., Ittleson, 1960, p. 67). 
If the two-dimensional cross section 
of Figure 1B were an adequate repre- 
sentation of the three-dimensional 
situation, our question would already 
be answered: Many different object 
displacements are optically equival- 
lent, in cross section, to one and the 
same optical transformation, 

In the real three-dimensional situa- 
tion, the optical equivalence classes 
will be shown to be much more 
restricted than suggested by the 
cross section of Figure 1B. For this 
analysis, algebraic techniques are 
more convenient than geometrica] 
constructions. Matrix algebra at the 
level developed by Birkhoff and 
MacLane (1965) will be used. To 
facilitate understanding, a verba] 
Summary of the algebraic argument 
follows. s 

Figure 2 shows the analysis in ab- 
Stract form. To begin with, the O 
Mapping is defined, which maps ob- 
jects into optical arrays at the eye's 
station point. This O Mapping does 
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not preserve three-dimensional struc- 
ture, since the optical array (and the 
image that it defines on the retina) is 
essentially two-dimensional. . 

The central problem of the analysis 
concerns the O* Mapping in Figure 2. 
This is the mapping of object dis- 
placements, D, into optical trans- 
formations, T. First it is shown that 
the O* Mapping is determined jointly 
by D and by the initial position of the 
object; thus the diagram in l'igure 2 
shows two "inputs" for this mapping. 
Second, it is shown that if the object 
is a flat surface, the O* Mapping 
Causes a specific geometrical trans- 
formation of the optical array: a two- 
dimensional projective transformation. 
Projective transformations have defi- 
nite Properties of their own, inde- 
pendent of the Optical array to which 
they are applied, just as one may 
Specify an object displacement with- 
Out specifying the object. Finally, 
it is shown that for a multifaceted 
object, the O* Mapping causes dis- 
tinct projective transformations for 
each flat surface of the object, when 
the object as a whole is displaced. 
That is, the complicated optical 
transformation caused by displacing 
à solid object can be analyzed into a 
family of distinct projective trans- 
formations, PT, . . . PT,, one PT 
Corresponding to each face or coplanar 
Set of points in the object. This last 
finding, based on what will be termed 
the Parallax Theorem, suggests that 
further analysis concentrate on pro- 
jective transformations of the optical 
array. 

The question with which we began 
now becomes: Is there a narrowly 
limited equivalence class of surfaces 
and displacements for every projective 
optical transformation? The affirma- 
tive answer to this question is con- 
tained in the Distance and Orienta- 
tion Theorems. 


OPTICAL MOTIONS IN SPACE PERCEPTION 


The final problem dealt with con- 
cerns the properties of projective 
optical transformations and how they 
might be registered by the eye. For 
this problem, the optical array is 
converted into an image on a flat 
plane tangent to the retina at the 
fovea. In terms of this image plane, 
eight parameters of projective trans- 
formation are specified, and they are 
shown to bear a complicated rela- 
tionship to the parameters of object 
surface and displacement. It is 
further indicated how eye movements, 
by fixating or tracking certain parts 
of the transforming array, may sim- 
plify the image movement actually 
impinging on the retina and thereby 


facilitate its registration. 

Tur MAPPING OF OBJECT DISPLACE- 

MENTS INTO OPTICAL 
TRANSFORMATIONS 


The following symbolic conventions 


will be used: 

a, b, +> =scalars 

a, b, ++: =3-unit row mat- 
rices; for €x- 
ample, (a1 bi c1) 

4, B, os —3X3 square mat- 
rices 

a’, A’, etc. =transposed mat- 
rices 

AB, aB, ab’, etc. =matrix products, 
multiplying the 
rows of the left- 


hand matrix by 
the columns [o 
the right-hand 
matrix. 

— the set of all s 
multiples of a, 
A, etc. 


calar 


[na], [nA], etc 


The following correspondence - 
tween matrices and geometrical con- 
cepts will be employed: 
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a, 105 : vectors (or points in 3- 
space) relative to a set of coordinate 
axes. 

aa'=a;2-+ay?+-a;?: squared length of 
vector a= |a|? 

ab —ajbi--asbsd-asbs: inner product 
of vectors a and b= |a| |b|cos Zab. 

ap! —1:a plane P, whose inverse inter- 
cepts comprise $. 

aCC'a'—-0: a quadratic surface in 
space, orthogonally equivalent to 
the points satisfying the equation 
cai? +cat d- 385? — 0, where €i, C2, 
and cs are the characteristic roots 
of CC’. 


Figure 1B can now be interpreted 
in three-dimensional space by setting 
normal orthogonal base-vectors on 
the eye's nodal point: a lateral or X- 
axis vector perpendicular to the dia- 
a vertical or Y-axis vector 


gram; 
in the diagram; 


running vertically 
and a line-of-sight or Z-axis vector 
running horizontally. Each object 
point in space now corresponds to a 
vector € = (XeYeZe) 

The optical array will be repre- 


sented in terms of a single pencil 
of light rays converging at the eye's 
nodal point, as in Figure 1B. Since 
the nodal point is the origin of the 
vector space, each ray is a one- 
dimensional subspace ; that is, the 


set [ne] where n ranges over the 


scalars. - 

With these definitions, the O Map- 
ping in Figure 2 of object points into 
optical rays becomes: 


O(e) = [ne] [1] 


tes a singular perspective 
three-dimensional 2 
i e two-dimensional space 0 the 
d array (cf. Klein, 1908). It is 
clearly a many-to-one mapping: Any 
two object points €i and e; have the 

long as €» = nei 


same mapping $9 2 
for some scalar n. The mapping has 


This constitu 
mapping © 
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one important restriction: Object 
points behind the eye do not reflect 
light into it. Therefore there is the 
following qualification on Equation 1: 


If O(e) exists, then ze 20 


O is not, however, the needed 
mapping of object displacements into 
optical transformations. This map- 
ping is O* of Figure 2, and is derived 
indirectly as follows: 

The displacement D in Figure 2 is, 
by definition, a rigid motion, a change 
of position without change of size or 
shape. Every such transformation 
is a combination of a rotation and a 
translation, and can be represented as 
follows (Birkhoff & MacLane, 1965, 


p. 243): 
D(e) 2 eR +4 [2] 


where R is orthogonal, its elements 
being the direction cosines of the 
rotation, and / is a TOW matrix, its 
elements being the X, Y, and Z 
components of the translation, 

Having thus determined the O 
Mapping and the Displacement D in 
Figure 2, it is possible to derive the 
algebraic expression for the Trans- 
formation T of the optical array. 
The initial optical array produced 
by the object with points e is given 
by the O Mapping; that is, the rays 
[ne]. The transformed Optical 
Array 2 in Figure 2 is given by apply- 
ing D and O in sequence to the object 
points e: 


T([ne] = OD(e) 
= O(eR + t) = [ner + nt] [3] 


This equation, representing the gen- 
eral case of a three-dimensional dis- 
placement of a three-dimensional ob- 
ject, is difficult to analyze. Far from 
being a one-to-one transformation, E 
is “many-to-many”: T([ne] is not 
uniquely determined by [ne], nor 
vice versa. 
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If attention is limited to coplanar 
points of the object, however, the 
Optical transformation T can be 
analyzed into simpler components. 
It will be shown, first, that the 
optical transformations caused by 
displacing coplanar points are of a 
Special category, namely the group 
of projective transformations. It will 
then be shown that the general 
Optical transformation T can, at 
least in many cases, be expressed as à 
set of projective transformations, 
PT,.. EI 

All coplanar points satisfy an 
equation of the following form: 


[4] 


where the elements of P are the in- 
verse intercepts of the plane P on the 

» Y, and Z axes. This means that 
for the flat surface P the displacement 
(Equation 2) is equivalent to a linear 
transformation of space represented 


as follows: 
D(e) = eA [5] 


for all e such that ep’ = 1 and where 
A=R+ pt, (Note: p't is a 3 X 3 
matrix of rank 1.) Combining Equa- 
tions 2, 3, and 5 gives the new optica 
transformation PT, which is produced 
by displacing coplanar object points: 


PT[ne] = [neA] [6] 


Equation 6 Satisfies the definition 
of a projective transformation (Birk 
hoff & acLane, 1965, 277). 
Such transformations are uniquely 
determined by only four rays (8? 
long as no three of them lie in the 
Same plane through the origin) an 
their transforms, 

The mapping O* of object displace- 
ments into optical transformations 
can now be defined for coplanar ob- 
Ject points, in exact parallel to the 
mapping O of object points into the 
Optical array, For this purpose, ! 


ep’ = 1 
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Is necessary to note that two matrices 
oP ates Ay will determine the same 
tion Ba transformation in Equa- 
Seine d. and only if Ay = mA, for 
1965 — m (Birkhoff & MacLane, 
Yeh EU ! Thus a projective 
set of rmation is represented by a 
ph matrices [mA], just as an 
matr ray is represented by a set of 
tatie [ne]. In terms, of the 
bue ces which represent object dis- 

ment (R and /) and the object 


plane (p): 


O*(R,,p) = [mR + mp’t] 

[7] 

= [mA] 

Tees question to be raised in the 
One Section is whether O* is a one-to- 
to mapping, unlike O. It remains 
Whet] determined here, however; 
«Jer the foregoing formulation of 
UE terms of coplanar object points 
Optic. T rve to analyze the genera 
or al transformation of Equation 3. 
“sider the case of a multifacete 
Ject; that is, one with several plane 
Vill c Displacing such an object 
orma ause a set of projective E 
aii of the optical array. ` i 
be a distinct projective trans- 


Or; Y 

objecition for each distinct face of the 

supot? Furthermore, could nonplane 
o induce 


r 

og of the object als 

tica Ctive transformations of the «a 
ang array? These questions can k 
Wered as corollaries to what may 


e 
Called the Parallax Theorem- 


Pa 
rallax Theorem 


sani, wo object points map ! 
do g, ODtical ray, they wil 
Jec 9 after they have both b 
Specifi to a displacement, except 
"i ed circumstances. nave 
the °2/ If two points 4 and €2 
the; Same optical array mapping 
"quí ne for some scalar 
the ation 1). Now supP 
transforms after disp 


under 


ose 
lacement 
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were to still map into a common 
optical ray. This implies: 


eR 4- (9 m(eR +t) 
= m(nak + i) 


This vector equation can be rewritten 
as follows: 


(1 — mn)aR = (m — 1: [8] 


Equation 8 can be true only under 
two circumstances: (a) if eR lies in 
the same direction from the eye as /, 
or (b) if t is zero. 

Corollary i. When a nonplane set 
of object points is displaced, it does 
not cause a projective transformation 
of the optical array, except under 


specified circumstances. 
Assume a set of five co- 


Proof. u 
planar object points: êr- «es. When 
subjected to a displacement D, they 


induce a projective transformation 
PT on the optical array, where PT 
= [nR + np't] (Equation 7). Now 
assume that és is replaced by another 
object point és which has the same [9] 
mapping as €5 but does not lie in the 
plane of e1°* €» If es is to induce the 
same projective transformation as 
£08 then Des must have the same 
O mapping as e, But this con- 
the above theorem, with the 
for that theorem. 
ccurs for the 
hen there are only four 
Since any four rays 
i determine a 
and ther tene those rays 


ee i 
ective transforma 
ted not originate in coplanar object 
ints. AE 
When n dis 
ollary i 
pe subject to a ps d 
lis É ement, they induce n distinc 
pee formations of the op- 
wie ay cept under specified 
tical 8 ' 
circumstant. em ane 
Mit cannot, in genera» 
nonplantngle projective transforma- 


v 
( 


Two forms (a and b) undergoing 
projective transformation, PT). 

ating 
from 5? out of the frontal plane to 55° from 


frontal, around an axis tilted slightly clock- 
wise from vertical.) 


Fic. 3. 
the same 
(PT; corresponds to a flat surface rot 


tion of the optical array. 
tion 6 shows that every set 
object points induces a projective 
transformation of the array. Hence 
every distinct set of coplanar points 
induces a distinct Projective trans- 
formation of the array, T he excep- 
tions to this are the Same as to 
Corollary 1. 


But Equa- 
of coplanar 


Discussion 


The analysis has shown that, just 
as objects are mapped into optical 
arrays, displacements of objects (so 
long as the objects are flat) are 
mapped into projective transforma- 
tions of the optical array. Whatever 
the form or texture of the flat surface, 
so long as it undergoes a specific dis- 
placement from a specific position, 
it will produce the same projective 
transformation in the optical array. 
(See Figure 3.) en 

All the optical rays originating in 
the flat object surface will ‘‘move 
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together," not in the sense that their 
separations remain constant, but in 
the sense that if three rays lie in the 
same plane they will remain coplanar. 
That is, in terms of the retinal image 
formed by the rays, straight image 
lines will remain straight, while u^ 
lengths and orientations vary. E. 
the visual system is able to respon ^ 
selectively to optical motions ig 
preserve straight image lines, then 1 
will have the basic capacity for 
discriminating spatial displacements 
of flat surfaces. ] 
Nonflat object surfaces cannot E 
duce projective transformations Pa 
the array, except under very specia 
circumstances (Parallax heor i 
Corollary 2), If the object is mu ti 
faceted, each of its surfaces will aie 
a separate projective og mere 
of the array (Corollary 2). In oe 
fashion, the complicated optical trans- 
formation, T, caused by displace 
ments of three-dimensional objects 
can be analyzed into a family, 
PT1- --PTa, where n is the number © 
flat surfaces in the object. (See 
Figure 4.) Objects with curved et 
faces may also be so treated, in big 
of planes which intersect them 
many points, or in terms of tangen 


; 35 oi 
planes which approximate their SU 
faces. 


T(b) 


PT(b) 


Pr (b) 


b 
Fic. 4. Left and right halves of pe 
in Figure 3 undergoing two different PY The 
tive transformations, PT; and PT». n ob- 
effect corresponds to a 50° rotation of a 
ject surface with a 70° bend in it.) 
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Tur SprcirIcıry OF OPTICAL TRANS- 
FORMATIONS TO OBJECT 
DISPLACEMENTS 


The algebraic model provides a 


means for answering the question 
with which we began: How many dif- 
ferent object displacements are op- 
tically equivalent to one and the 
same transformation of the optical 


array? The question can now be put 


in terms of projective transformations, 
i presentation. In 


brief, the question now consists of 
asking in how many different ways a 
3 X 3 matrix A can be expressed as 
the sum of an orthogonal matrix R 
and a rank 1 matrix pt (Equation 
5). The result of the analysis will 
show that, in the general case, a 
given projective transformation can 
be caused by two different object 
displacements, each applied to a 
different initial surface orientation. 
Intuitively, the argument depends 
on the fact that object displacements 


preserve distance between object 
points; that this requirement leads to 
a quadratic locus; and that for a 

include à flat 


quadratic locus to 
surface means that it must be, 


general case, two planes. 
tion as put sche- 


in the 


Consider the ques Cua 
matically in p ^ y od 
tr ation of the optica 

ansformatt msn 


caused by the displacement 
surface P, how many other surfaces 
P, could be displaced in an opticaly 
equivalent fashion? Each oni e 
surface Po is specified by a co m 
matrix p'o just as P was ow e 
by the matrix ? (Equo a = 
displacement © P, is speci i Es 
orthogonal matrix Re an a srt 
matrix te (cf. Equation 2). e í 
placement of Pe is therefore equiva- 
lent to t formation 


he linear transi 5 
Ac = Re + Pile » 


(cf. Equation 
For notational convenience, we shall 
continue to use 


p, Rt and A without 
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subscripts as the corresponding values 
for the true surface P. 

In order for the confusable surface 
P. to cause the same optical trans- 
formation as the true surface P 
Equation 6 sets the following condi- 
tion: 


A.— mA or 


Ro+p'de=m(R+p'1) 
[9] 


There is a restriction on this equation, 

deriving from the restriction on the 

O mapping (Equation 1): The scalar 
multiple m must be positive, or else 
A. would transform object points 
to the region behind the eye. 

Two theorems will be proved about 
Equation 9, and taken together they 
provide the answer to the above 
question. 


Distance Theorem 


If the confusable surface P. is par- 
allel to the true surface P, then it will 
induce the same optical transforma- 
tion as P if and only if Re = R and 
te = ht, where h is the ratio between 
the distances of Pe and P from the eye. 

Proof. Let P be determined, as 
above, by eb’ = 1, and Pe by ep's = 1. 
If the planes of P and P, are parallel, 
and the points in Pe h times further 
from the eye then p. = (1/h)p- 
Consider now the separations be- 
tween object points either in P or 1n 
P.e: All such separations are vectors 
= e, €» Where & and 
i P, or both in Pe Due 

i ity of matrix mul- 


tiplication, dp’! = exp’ — eb from 
which we infer the following: 

dp' =9 [102] 
d's = 9 = d.(1/h)p" = dep! [10b] 


In short, d and d, both lie in the plane 
through, the origin designated by 

1—0. The proof now depends on 
the fact that when P or P, is dis- 
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placed, that is, moved without change 
in size or shape, the length of the 
separation vectors d in this plane 
must remain unchanged: 


(4A) (4A)! — (dA)(dA) 2-dd [11] 


From Equation 9 and the distribu- 
tivity of scalar multiplication jt 
follows that 4. = A. To go from 
this to Re = R we need to note that 
dA, — dR, and dA =dR (combine 
Equations 9 and 10). Therefore 
dR, = dR for all vectors d lying in 
the plane dp’ = 0. But the rotation 
of a plane determines a unique rota- 
tion of all space, since rotation pre- 
serves the length and sign of vector 
cross-products. Therefore Re: = R 
From this it follows that Pd. = p, 
and 4, = ht trivially. 

Corollary 1. Given any component 
R, b, or t, the other components of a 
matrix A are determined, at least 
to within a scalar multiple. 

Proof. If p is determined, the 
theorem proves the corollary. If R 
is determined, clearly ’t is deter- 
mined by subtraction from A. The 
columns of p't are scalar multiples 
of p’; the rows are scalar multiples 
of 4. If ¢ is determined, finally, the 
same algebraic argument as in the 
above theorem can be made; that is, 
for all 6’ such that ¢b’ = 0, Ab’ = Rb’, 
and A is determined as above, 


Interpretation of the Distance Theorem 


This theorem shows that the optical 
transformation is in no way specific 
to the particular distance of the sur- 
face P or to the length of the transla- 
tory component of D, but that the 
Optical transformation is specific to 
the ratio between those two quanti- 
ties. A family of flat surfaces dif- 
fering only in their distance from the 
eye have a common surface orienta- 
tion, and this property is specified by 
the column matrix p’ (or any scalar 
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multiple thereof) in the equation 
dp’=0. (Contrast this with the 
particular surface specified by p’ in 
Equation 4, ep’ = 1.) 

It remains to be shown whether a 
projective optical transformation is 
specific to a particular surface orien- 
tation. To investigate this, the 
following lemma will be needed : 

Lemma 1. If A — R4 pt for 
some matrices R, p’, and £, then only A 
in the set [mA] is equivalent to à 
displacement of an object plane. 

Proof. The proof follows from a 
closer examination of Equation 11, 
which can be rewritten as follows: 


d(m:44' — Dd 2 0 [12] 


where I is the identity matrix. For 
every value of m, m?A A’ is a pos 
tive definite matrix (Birkhoff & “re 
Lane, 1965, p. 256), and hence mAs 

has three positive characteristic roots, 
and (m*44' — 1) has three charac- 
teristic roots. Thus Equation 12 15 
satisfied by a quadratic surface 1n 
Space, and without loss of generalii 
may be written (relative to a suitable 
orthogonal basis) as follows: 


(mtu — 1)X? + (mv — 1)Y? 
+ (mw — 1)72 = 0 [6 


Where u, v, and w are the (positive) 
roots of AA’. But the vectors d must 
also satisfy Equation 10; that is, they 
lie in a plane, Equation 13 is satishiec 
by a plane only if at least one of ph 
coefficients is zero, and the other C° 
efficients (if more than one) are ° 
Opposite algebraic sign, Thus i a 
(or u, or w) is the middle-valued ro e 
of AA’, my = 1. This can be En 
for only one value of m, since Nail 
fixed, and m 20 (restriction, S 
Equation 9). By the hypothesis, n 
the lemma, a plane satisfies Equal! 4 
13 if m = 1; hence this is so ot! 
MS QED. 
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Orientation Theorem 


In the general case, a projective 
transformation of the optical array is 
specific to two alternative surface 
orientations, and two corresponding 
displacements. The exceptions to this 
general case will be specified. 

, Proof. A projective transforma- 
tion corresponds to a set of matrices 
[mA] (Equation 7). But by the 
foregoing lemma, only A, as defined 
in the lemma, corresponds to an 
object displacement. Hence the pres- 
ent theorem depends on showing that 
A can be decomposed in only two 
ways into the sum of an orthogonal 
matrix and a rank 1 matrix p’t. To 
show this, we note that Equation 
13 sets limits on p’. No plane satis- 
fies Equation 13 unless one of its 
Coefficients is zero. If v (or u, or w) 
is the middle-valued root of AA’, as 
in Lemma 1, the equation is sim- 
plified : 


(u— 1)x*-- (w—1)2-0 


The following cases arise: 
Case 1: The general case is when 
u>1 and w «1, or vice versa. In this 
case, Equation 14 is satisfied by two 
planes passing through the origin, one 
of which must be the true plane 
dp’ = 0, and the other is the con- 
[usable plane dp'e = 0. By Corollary 
1 of the Distance Theorem, Po speci- 
fies the optically equivalent displace- 
placement, De- The confusable sur- 
face pe bears the following relationship 
to p, as can be seen by writing 
Equation 12 in scalar form, letting 
dp’ = 0 and canceling that term: 
pe = |t + UR’ [15] 
where |t| is the length of t. 

Case 2: If u or W'S equal to one, 
then only one plane, dp’ = 0, will 
satisfy Equation 14, and there is no 
confusable surface orientation. From 
Equation 15 it may be deduced that 


[14] 
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the absence of a confusable surface 
orientation depends on the following 
special relationship between p and 
the displacement: 


m — |t]? 
2 


for some scalar m # 0. 

Case 3: If both u and w are equal 
to one, then all points in space will 
satisfy Equation 14, and the result- 
ing projective transformation will 
bear no unique relationship to an 
object displacement. From Equation 
15 it can be deduced that this de- 
pends on the following alternative 


conditions: 
p=0 or t=O or m-0 [17] 


p=iR — [16] 


where O is that vector with all 


elements zero. 
Corollary 1. Ifa multifaceted ob- 


ject is displaced, thereby inducing a 
set of simultaneous projective trans- 
formations of the optical array, that 
set of projective transformations will 
specify the object displacement, and 
the orientations of the object faces. 

Proof. Under the hypothesis of 
this corollary, each of the projective 
transformations will have A matrices 


of the form: 
An = R+ phat 


o of these matrices have a 


Any tw 
difference of the following form: 
A,—A2= (p^ — p's)t, etc. 


That is, two of the projective trans- 
formations specify t. By Corollary 
1 of the Distance 'Theorem, this 
specifies R, and also ?^; and p's, etc. 
Corollary 2. If a single set of 
coplanar object points is present, 
and is subjected to à sequence of 
displacements, aS in the usual case of 
a continuous object motion, there will 
be a confusable surface orientation 
only under specified circumstances. 
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Proof. Under the hypothesis of 
corollary, the optical array is su = 
jected to a sequence Ai, dsi ie a 
projective transformations. 1 = 
A» apply different Sienaan LO 
the same true surface P. For their 
confusable surfaces Per and Peo to be 
identical, the following is required by 
Equation 15: 


t| b + 2G, 
P = [klp + 25, [18] 


This can be rewritten as a condition 
on the rotation and translation com- 
ponents of Ao: 


the product 


Interpretation 


of the Orientation 
Theorem 


native 
and two cor- 


nts. Thus, in 
the general Case, there is one con- 
fusable surface orientation, B. 
A one-to-one co! 


rrespondence is 
two-to-one cor- 
he object under. 
includes two or 


respondence when t 
going displacement 
more sets of coplanar Points, or when 
a single set of coplanar Points under. 
goes a sequence of displacements, 


Discussion 


When a flat object surface Moves, 
it causes a continuous motion of the 
optical array. This optical motion 
may be termed a projective optica] 
motion, since it will be composed of a 
continuous sequence of projective 
transformations. Any one of the 
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transformations is sufficient to specify 
two alternative object surfaces, and 
two corresponding ys 
(Orientation Theorem), but not = 
solute distance (Distance ag 5 
Any two of the transformations a 
suffice, in the general case, to ped 
à unique surface orientation for Les 
initial position, and consequently is 
unique sequence of displacemen : 
(Corollary 2 to the Orientation Theor 
rem). Thus the projective iar on 
motion caused by the moving nar 
surface contains a great deal he 
redundant information concerning the 
object surface and its motion. Thie 
makes it likely that even a low a 
of precision in registering Mee 
transformations would make possib ei 
Over a sufficient exposure period, ie 
accurate identification of object sur 
face and motion. al 
In the case of an object with severe: 
flat surfaces, the family of pac 
transformations produced by any i 
Object displacement will suffice, se 
the general case, to specify that e 
placement uniquely, and consequent y 
the set of object surfaces (Corollary 
1 to the Orientation Theorem). x 
suming that the visual system cou z 
register concurrent projective tani 
formations, this means that mu 5 
faceted objects produce an eve" 
greater degree of redundant Put 
tion during Optical motions, so kp 
Perception of such objects should s 
More accurate than the perception 
single flat surfaces, d 
The case of objects with em 
surfaces is not so clear. Since bres 
isa continuous range of planes whic é 
intersect such surfaces, there mue n 
a continuous range of project! S 
transformations induced by such d 
aces. It seems likely that registr 
tion of such transformations woul la- 
lifficult, ang consequently a e 
üvely low accuracy would be Ps 
dicted for the perception, by mea 
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of optical motions, of curved surfaces 
and their movements. 


PARAMETERS OF THE PROJECTIVE 
OPTICAL TRANSFORMATIONS 


_ What are the properties of pro- 
jective optical motions? That is, 
what would be involved in their 
registration by a visual system, and 
their "decoding" back into three- 
dimensional objects and displace- 
ments? 

_ The properties are most conven- 
iently specified not in terms of the 
optical array, but in terms of the 
image which the array produces on the 
retina. The image may be regarded, 
geometrically, as the intersection of 
the rays comprising the array with 
the two-dimensional surface of the 
retina. If only a small region (e.g 
the fovea) is considered, it may be 
approximated by a tangent plane. 
This image plane, containing image 
vectors i, is specified by to! = 1 lor 
some fixed matrix 0. If the fovea 
is taken to lie at unit distance from 
the nodal point of the eye then the 
“foveal image plane” zi 


is z= 
(When the eye turns, or when a dif- 
ferent region of the retina is COn- 
sidered, it is equivalent to rotating 
the image plane.) 

In terms of the 


Equation 6 can be 


foveal image plane, 
written out In 


scalar form as follows: 
_ ax iby =¢ 
PT (x) = gx thy -™ 
P dx + ey — Í  r20 
mied a ] 
PT(2-7?-7 -1 
The number of coefficients in these 
scalar equations js nine; however, 
independent 


only eight of them are 
since numerator and de- 


parameters, z 
nominator O both fractions can be 
multiplied by @ scalar without altering 


Fic. 5. g The four horizontal parameters 
of projective transformation in the image 
plane. (a is stretching, b is shearing, ¢ is 
translation, and g is foreshortening. The 
four vertical parameters can be seen by 
rotating the Figure by 90 degrees.) 


their function values. Hence it may 
be concluded that there are eight 
parameters of projective image trans- 
formation induced by displacing an 

his result may be 


object plane. jh 
compared with Gibson's identifica- 


tion of only six parameters of image 
transformation (1957, p. 290). The 
discrepancy arises because Gibson 
was only looking for correspondents 
of six parameters of displacement, 
and neglecting the two parameters of 
surface orientation at the start of the 
displacement. 
The eight p 
into four types, 
lateral (X-axis) 
dimensions of the image. 
illustrated in Figure 5. 


Image translation. (c and f 
ters; Figure 5c). 

Image stretching (a and e param- 
eters; Figure 5a)- 

Image shearing (b and d param- 


eters; Figure 5b). 

Image foreshortening (g and h 
parameters; Figure 5g). 
]mage stretching bears special com- 
ment. Equal vertical and lateral 
stretching, that is, magnification, is 
produced by the parameter m in 


arameters can be sorted 
each applying to both 
and vertical (Y-axis) 
They are 


parame- 
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Equation 20. Separate lateral and 


vertical stretchings are produced by 


the parameters a and e oe 
However, it is clear that these three 
coefficients only result in two inde- 
pendent image changes, which we 
will describe as lateral and vertical 
stretching. It is also noteworthy 
that the transformations of lateral 
when equal, 
and supplemented by suitable stretch- 
of the image, 
Parameters of 


vertical components 
and shearing, 


A visual system 
ing these four 


; 1 the Parameters of 
object displacement, 


as proposed in 
(1957) analysis, | 
readily be 


scalar form, 


transformation param 
to those 
writing 


et 
of the object, 


Vertical Rotati 
Lateral Image Stretching - 

Line-of-sight or 

Vertical Rotation 
Lateral Image Shearing = 


Line-of-sight Rotation 


Lateral Image Foreshorter lung 


Vertical Rotation Component + 


on Component 
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elementary, algebra, a maate 
account of the relationship will ri 
given here. For "e 
following parameters will be use 


: its displace- 
describe the object and its displa 
ment: 


Initial surface orientation: E 
ured relative to the frontal e : 
Lateral slant is measured by the s M 
of the surface's intersection pln 
XZ plane; vertical slant by the s the 
of the surface's intersection with 
YZ plane, d 

Initial surface distance: ig 
by the intercept of the surface 
the line-of-sight (Z-axis). " lation 

Rotation axis: measured in re “The 
to the three coordinate axe the 
vertical rotation component 18 axis 
the projection of the rotation an 
on the vertical coordinate axis, nal 
similarly for the /ateral and ed o 
Sight rotation axes. (The. — be 
magnitude of the rotation will itative 
taken account of in this qualita 
€scription, its 

Treni axis: measured ee 
ateral, vertical, and sag agen 
Ponents. (It can easily be "d an 
that any object rotation — the 
axis which does not pass throug 
€ye involves 4 translation.) 


avametérSi 

In terms of these object cie 
Equation 6 can be written ou 
ollowing way: 


4 Lateral Translation 
or RAO, 
Surface Distance 


5 nt) 
Component 4. (Lat. Trans.) (Lat. Sla 


Surface Distance 


"T. rt, Slant) 
Component + (Lat. Trans.) (Vert. Slant) 


Surface Distance 
rt. slant) 


(L-of-s Trans.) (Ve 
Mof-s Trans.) (Vert. Slat 


Surface Distance 
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er a a z " 
or vertical image transformations, interchange the terms “vertical” and 


u ; 
lateral" wherever they appear in the above equations. 


For simplicity, the 


case of equal lateral and vertical stretching, or magnification, can be listed 


separately : 


Image Magnification — 


Lateral or Vertical Rotation Component 4- 


From these equations it is clear 
that there is far from a direct one-to- 
one correspondence between the 
parameters of image transformation 
and those of object displacement. 
Indeed, the relationship may be 
summarized by saying that each type 
of image transformation is influenced 
by all four types of object parameter, 
and by all three dimensions of space. 
It follows from this that detection of 
the parameters of an image trans- 
formation would need to be followed 
by an interaction. between them, in 
order for the object characteristic 


to be identified. 


Effects of Eye Movement 

Since the eyes are free to move, it is 
possible that by suitable rotations the 
parameters of an image transforma- 
tion may be simplified on the retina, 
either in the sense that some of them 
are made zero, or in the sense that 
they are made to bear a more direct 
relationship to the object motion 
than is true of the general case il- 
lustrated above. For example, reflex 
pursuit movements of the eye will 
tend to hold foveal image points 
stationary on the retina, and this is 
equivalent to zeroing the translation 
parameters of the optical motion. 
A detailed analysis of the possibilities 
will not be given here, but two will be 


sketched : : p 
Choice of fixation € E itae 

a J ? a suitable 

is t o as to adopt a $ 

S turned s he axes of the 


ot 7 t 
posture relative to. 
object and its displacement, the 


parameters of the image transforma- 


Line-of-sight Translation 
Surface Distance 


tion may be simplified. For example, 
if there is no object rotation, and the 
eye is turned so that the line-of-sight 
lies along the translation axis, stretch- 
ing and foreshortening will be the 
only image transformations. This 
is essentially the case considered by 
Gibson, Olum, and Rosenblatt (1955). 
As a further example, if the rotation 
axis is fixed in space, and the eye 
turned so that (a) the vertical axis 
is parallel to the rotation axis and (b) 
the line-of-sight axis is pointed in the 
direction of the rotation axis, there 
will be no vertical shear or translation. 
Algebraically, such changes of fixation 
can be represented as orthogonal 
changes of the coordinate system, 
changing the transformation matrix 
A into the orthogonally equivalent 
matrix QAQ', where Q is the matrix 
of the eye rotation (Birkhoff & 
MacLane, 1965, p. 245). 

Pursuit eye movements. Further 
simplifications of the parameters of 
image transformation may be pro- 
duced if the eye tracks the movements 
of a certain image point, thus stabil- 
izing it on the retina. Algebraically, 
pursuit movements change the trans- 
formation matrix A into the row- 
equivalent matrix QA, where Q is the 
matrix of the eye rotation (Birkhoff 
& MacLane, 1965, p. 245). 

These observations on the effects 


of eye movement suggest that ex- 
ploring the optical array with the 
eyes, and determining „when the 
foveal transformation achieves a cer- 
tain simplification, may serve as a 
visual-motor means for identifying 


the important parameters of the 
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object displacement (cf. Gibson, 


1959). 


GENERAL Discussion 


This study has followed the pro- 
gram suggested by Gibson (1959) of 
looking for high-order variables of 
optical stimulation which are in close 
correspondence to the environment, 
and may therefore Serve as sources 
of information to the visual system, 
It was shown that, as Gibson (1957) 
hypothesized, optical transformations 
are in close correspondence to object 
displacements, and also—as Gibson 
did not make clear—to their Starting 
positions. 


the Present 
First, only 
' Not changes of 
considered, If 


reversed, 
(cf. Fieandt & 

Gibson, 1959; Poincare, 

limitation on the 

study was that neither the ¢ 

nor the speed of object dis 


he present findings may be com- 
pared with those of other studies of 
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the moving shadow illusion mee 
Stein, 1962). Mach (1914) suggeste 
that when an image is clongated E: 
shortened along one dimension — i 
induces the impression of ad 
in depth. Thus Mach identified g 
parameter of projective ian ora 
tion, stretching, as critical. hop. 
(1953) presented evidence which is s 
conflict with Mach's theory, an 
found that the critical stimulus xd 
the impression of rotation in dep h 
is the simultaneous change of length 
and angle in lines of the image; 
that is, shearing or ey nete 
Johansson (1964) found evidence bees 
Simultaneous change in both -[ 
and vertica] image dimensions was i 
effective stimulus for motion in dept 3 
"he present analysis shows that any 
one of these hypothetical meghatni 
would fail to accurately identi y 
object characteristics except in y 
Special conditions (p. 563). If Ee 
visual response is to be specific to pn 
object characteristics, it must a 
Soverned jointly by all the naama 
9! projective image transformation: 


: à ; n 
translation, Stretching, shearing, à 
foreshortening, 
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A FREQUENCY THEORY OF VERBAL- 
DISCRIMINATION LEARNING! 


BRUCE R. EKSTRAND, WILLIAM P. WALLACE, Ax» BENTON J. UNDERWOOD 


Northwestern University 


The frequency theory of verbal-discrimin 
lates that the cue for discrimination is t 
occurrence between the correct (C) and 
a VD pair. It is postulated that, as VD 
2:1 frequency difference in favor of the 
use this difference as their cue. 


ation (VD) learning postu- 
he difference in frequency of 
incorrect (I) alternatives of 
learning proceeds, at least a 
C item is built up, and Ss 
ays selecting the item with 
rence, Ss can perform effi- 
which produced results in 


The present paper deals with the 
processes involved in verbal-discrimi- 
nation (VD) learning. In VD learn- 
ing, pairs of verbal units are presented 
to the subject (5), and his task is to 
discover which unit in each pair has 
been arbitrarily designated by the ex- 
perimenter (E) as being the correct 
unit. The correct alternative shall be 
referred to as the C unit and the incor- 
rect alternative as the I unit. Nor- 
mally a list consisting of several VD 
pairs is presented on a memory drum, 
and S must pronounce the correct re- 
sponse within a prescribed time limit. 
In a typical VD list, the C and I items 
are presented to S one above the other. 
(Half the time the C unit is presented 
above the I unit, and half the time the 
two are reversed.) The S is requested 
to choose one of the two units and pro- 
nounce it, after which E informs S as 
to the correctness of his choice. This 
information has been given verbally— 
E says “right” or "wrong"—or visu. 
ally—the memory drum is programmed 

1 This work was supported in part by Con- 
tract Nonr-1228 (15), Project NR 154-057, 

etween Northwestern University and the 
Office of Naval Research. Reproduction in 


whole or in part is permitted for any purpose 
by the United States Government, 
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to indicate the C item after S’s choice 
is made. Tt has also been typical ial 
VD procedures to omit a study tria 
at the beginning of learning ; since "es 
are only two alternatives (althoug? 
more alternatives could be employed); 
S has a 50:50 chance of heing correc 
without any learning taking place- $ 

The theory to be presented here Ben 
first developed to explain the rest! i 
of an experiment on transfer from on 
VD list to another VD list, but it be 
came apparent that it could be used i 
interpret the manner in which a sing 
VD list is learned. Tt will be useful 
review first the results of that trans im 
experiment (Underwood, Jesse, & E h 
strand, 1964) and the manner in ui 
the results were interpreted. The ra 
tension of the interpretation to er 
list VD learning will then be C 
sidered. 


TRANSFER EXPERIMENT 


The Underwood et al. ( 1964) eet 
periment employed three condito, 
The control condition involved . ts- 
learning of two unrelated VD list? 


Ss 
In the wrong condition (W): an 
learned one VD list to criterion jist 


then transferred to a second V. 


VERBAL DISCRIMINATION IN LEARNING 


mr the I item in each pair in 
iw a — the I item in List 2. 
with items of List 1 were replaced 
R (ri a C items in List 2. Condition 
V; i w) hei the reverse of Condition 
same de items of List 2 were the 
all of items that were in List 1, but 
List 1 the T items were changed from 
to List 2. 

T> results indicated that Condition 
ra essentially 100% transfer. 
in te er was initially almost aS high 
a e on W, but these Ss showed 
bens inability to improve their per- 
toeni across trials. Indeed, the 

rol group managed to reach cri- 


teri 
"M on List 2 before the W group. 
nderwood et al. offered the follow- 
r results. 


Ti interpretation for their T 
1e basis for the discrimination in List 
ees the subjective difference in fre- 
and T: of occurrence between Ls 
tion nM within a patr. In Condi; 
item S al of the I items were nem 
Wer rie List 2, and all of the C iten 
M e “old”—they had been in 
co always selecting the old items 5S 
"ld perform perfectly, which they €S 


Sent t 
ect did. Always selecting s 
“ie hai, vo units in 2 pai 

eo e thich shall 


te “rule” for VD learning Wape term 
wi. ud to as Rule £ The 
mm be used for € 
timere d this usage S$ 
s trued as necessarily me 
ett ely applies a Tule i ao 
it sense of the word. +” the pe 
al ion, all of the I items Wor diit 
E the C items were new. d be al 
Way. the appropriate rule would © the 
en to guess the less frequent ince 
a units in a pair ( ) 
is ee fully inform d ^ 
Since tween List 1 and "^ 
it sh at the start of learn"? 
Gere be very easy tO te 
in | old, initial transfer pro 
oth groups, and indee i 
t can be seen, howeve^ 


Ss 
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continued on List 2 the new items 
would start to get old, and the old 
items would get older. Underwood et 
al. reasoned that C items get older 
faster than I items because the C items 
ed once in the act of an- 
ticipation as well as seen once on the 
memory drum. The I items, on the 
other hand, are only seen on the drum 
and are not pronounced. Thus, each 
trial adds two frequency “units” to the 
C items and one unit to the I items. 


This is the basic postulate of the pres- 


ent theory. 
Assuming this 2:1 frequency differ- 


ence in favor of the C items, the failure 
of the W groups to improve with prac- 
tice is understandable. The new C 
items get two units on each trial 
whereas the old I items are getting 
older, but only at the rate of one unit 
on each trial. At some later point in 


learning, the new C items will be as 
¥ nd beyond that 


ld as the I 
point they will be older than the 
items. 


are pronounc 


licatio 
this point on will result in a 

; e because Rule 1 is now appro- 
For any given VD pair, Rule 1 
be the appropriate one, 
f the C and 


r than their corre- 


J items; therefore, the per- 


should suffer. 


to extend 
t al 


derwood E ” to the single-list VD 


hat in learn- 


transfer ' assumed t 

situn list, the cue for dis- 
ing @ sing e he subjective difference 
criminat ; o ccurrence between the 
e fre in each air. More 
C an ^a it is asse" ed that as trials 

ation re is puilt uP at least a ^* 

proce 


Du 
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frequency difference in favor of the C 
item, and that this difference can serve 
as the discriminative cue so that appli- 
cation of Rule 1 will result in the cor- 
rect response. 

Let us trace the sequence of events 
in the learning of a VD list and indi- 
cate the critical events assumed by the 
theory. On the guessing trial, a pair 
of items is presented, and S reads each 
one. This act of perception of each 
alternative has been referred to as a 
representational response (RR) by 
Bousfield, Whitmarsh, and Danick 
(1958). The RR to each item, accord- 
ing to the present theory, adds one fre- 
quency unit to the C item and one to 
the I item. After the two RRs have 
been made, 5 chooses one of the two 
items and pronounces it; this js his 
guess as to the correct one. This pro- 
nunciation response shall be referred to 
as the PR. It is assumed that the PR 
adds another frequency unit to which- 
ever item is Pronounced. If S made 
his PR to the C item, there would be 
22:1 frequency difference in favor of 


the T item, 

Now, by chance, half the PRs would 
be T items and half would be C items, 
So that if no frequency units other 
than the RR and PR were involved, 
the frequency-difference cue would be 
useless. The more frequent item in 
each pair would be correct in half the 
pairs (those pairs for Which S guessed 
the C item) and incorrect in half the 
pairs (those for which S guessed the 
Titem). Rule 1 would be appropriate 
for the former type of pair, and Rule 2 
would be appropriate for the latter type 
of pair. In order to learn on this basis, 
S would have to learn Which pairs go 
with Rule 1 and which go with Rule 2. 
For this reason, it is believed that a 
third type of response is occurring, 
namely, a tehearsal-of-the-correct-al. 
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ternative response (RCR). This RCR 
may be thought of as a pronouncing 
of the C item implicitly or explicitly, 
and it may occur more than once im- 
mediately after E supplies the infor- 
mation about the correctness of S's 
response. 

The nature of the hypothesized proc- 
esses in VD learning can now be more 
explicitly stated. In general, it is as- 
serted that the single VD list of ums 
related items is learned by application 
of Rule 1; namely, select the more ff 
quent alternative. In order for this to 
be adaptive, Rule 1 must he appropriate 
for all the pairs; that is, the unit git 
greater frequency must be the correc 
one. Pairs for which S guesses the | 
item on the first trial will have a pc 
difference in favor of the C dita 
Rule 1 will he appropriate. Pairs 10 
which S guesses the T item, as sed 
Out, would have a 2:1 difference ! 
favor of the T item, so Rule 1 bir: 
be inappropriate. However, if S mak 
one RCR after having been we m 
that his guess was incorrect, the diffe 
ence in frequency would be eliminate 
Such pairs would now have 2:2 ud 
and on the next trial if S guessed an 
this time chose the C item, there es 
then be a 4:3 frequency ratio in eae 
of the C item. Now Rule 1 wodd 
appropriate for this item. Tf S ma he 
more than one RCR after choosing a 
I item, the pair would immediately 
come appropriate for Rule 1. sso 
plicit in this discussion is the gre 
that if the guessing trial were cli! A 
nated (thus eliminating some O nce 
PRs to the T items), performa 
would be enhanced. e 

The extent to which these RORE 
made by Ss is not known, but for e 
quency to be the appropriate cue =f 
for Rule 1 to he the appropriate 
for every pair, it is only necessary 
Postulate that one RCR is made the 
that it is made only when 5 guesses 
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A item. In other words, the frequency 
unit that is added by the RCR just 
offsets the frequency unit that the I 
item received as a result of the PR. 
This returns the pair to a “neutral” 
state (2:2) so that there is a 50:50 
chance that S will make his PR to the 
C item on the next trial. If Ss make 
RCRs more than once for any given 
pair, and if they make RCRs when they 
have chosen the C item, the frequency 
cue would only be further enhanced. 
For example, if S makes a correct 
choice on the guessing trial, it has been 
postulated that there is a 2:1 difference 
in favor of the C item. But if, in addi- 
tion, S makes one RCR, there would 
be a 3:1 difference. 

As trials continue, there will be a 
buildup in frequency units for the C 
and I items, but this buildup will al- 
ways tend to favor the C item. Asa 
result, the difference in frequency units 
between the C and I items will in- 
crease, making the discrimination be- 
tween the more and less frequent items 
easier. Note that the interpretation 


states that the number of frequency 
units for each item can be determined 
Rs, and RCRs 


by counting the RRs, B 
that have been made to that item. It 
also is apparent from the above dis- 
cussion that it has been assumed that 
one RR adds as much to the subjective 
frequency of an item as does one PR, 
or RCR; that is, the equality of RRs, 
PRs, and RCRs has been assumed. 
Both of these assumptions are merely 


working hypotheses. It is quite possi- 
TEE tulated responses 


ble that the three pos M sen 
are unequal, that an RCR is “wor : 
more" than an RR, or that a PR o 
the C item (a correct response) 1$ 
somehow qualitatively different than a 


PR of the I item (a? error) in terms 
of the consequences for frequency units. 
For simplicity’s sake it has been as- 
sumed that these responses are equal 
in terms of frequency units added. 


569 


A fourth way in which frequency 
units may be added to VD items is 
through an implicit associative response 
(IAR). Bousfield et al. (1958) hy- 
pothesized that the presentation of a 
verbal unit elicits IARs related to the 
given verbal unit. Thus, if the word 
pog is presented, an IAR of cat (or 
some other related item) is presumed 
to occur. In a VD list the frequency 
of a given unit may be increased if 
that unit has occurred elsewhere in 
the lit as an IAR. For example, if 
the word cat appears in the list, it will 
attain a certain frequency value after 
one presentation based on RRs, PRs, 
and RCRs. The appearance of the 
word poc in the same list may increase 
the frequency value of the word cat to 
the degree that DOG elicits cat as an 
IAR. To the extent that the associa- 


tions are bidirectional, the appearance 
uency units to 


of car may add freq 

dog. In other words, S may “confuse” 
an JAR with an RR, so that these 
IARs may add frequency units in the 
se that an RR does. This fre- 
on by an IAR mecha- 
d to occur only in spe- 
eg, VD lists contain- 
elated words). 


same sen 
quency alterati 
nism is assume 
cific instances ( 
ing associatively T 

Another major process postulated 
here is that 5s can and do discriminate 
items on the basis of their frequency 
of occurrence. That they can make 
accurate judgments of the frequency of 
an item in the language has been shown 
by Underwood (1966). The Under- 
wood et al. (1964) transfer study 
shows that when frequency differences 


are extremely large, performance is 
essentially perfect; 4 fact which 
strongly suggests that the cue for dis- 


crimination in that study was the fre- 
uency difference. It has also been 
stated that Ss learn single VD lists 
by application of Rule 1; however, 
there is no reason to believe that when 
the frequency difference is reversed 
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(i.e., the less frequent member of each 
pair is correct), Ss cannot Operate us- 
ing Rule 2. The W condition of the 
Underwood et al. (1964) study shows 
that they can. Thus, either Rule 1 or 
2 can be applied depending on the 
appropriateness of the Situation. [n 
the single-list VD situation Where S 
is required to pronounce the C item, 
there is a built-in difference in fre- 
quency, due to this PR, which makes 
Rule 1 more appropriate, 


Tue EXPERIMENT 


The frequency theory, first devel- 
oped to explain the transfer results in 
the Underwood et al, (1964) experi- 
ment, was admittedly postulated after 
the data were collected and analyzed. 
As such, the theory requires verifica- 
tion under other situations, and this 
was the purpose of the present experi- 
ment. More particularly, it was de- 
sired that the theory as applied to the 
single-list VD situation be tested. 

It was reasoned that if the discrimi- 
native cue is the relative frequency of 
the two units in a Pair, increasing the 
difference in frequency should facili- 
tate performance (a difference of 4:1 
should be easier than 2:1), and de- 
creasing the difference should inhibit 
performance (a “difference” of 2:2 
should be more difficult than 2:1), 
The method decided upon to produce 
these increases or decreases was to 
have an item occur in two pairs; there- 
fore, the repeated item would be dou- 
bled in frequency, receiving two RRs 
per trial instead of one. For the mo- 
ment, let us consider only the frequency 
units added by RRs and PRs, and let 
US assume that the frequency ratio is 
normally 2:1 when there are no re- 
Peated items in the list. 

In the first condition, the same item 
occurs in two pairs, and it is Correct 
in both pairs; this will be referred to 
as the Same-right condition (SR). 
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Since the repeated item is the C item 
in the two pairs in which it occurs, it 
will have four frequency units added 
to it on each trial (two RRs and two 
PRs, assuming S guesses the C item). 
The I item in each of those pairs will 
have only one unit added on each trial 
(an RR). Thus, for each pair there 
will be a 4:1 ratio in favor of the more 
frequent item, and application of Rule 
I will result in the correct response. 
If a 4:1 ratio makes for easier dis- 
crimination than a 2:1 ratio, Condition 
SR should surpass a control condition 
in which there are no repeated items 
in the list. 

In the second condition, the Serie 
item occurs in two pairs, but it 15 a 
correct in both pairs; this was calls 
the Same-wrong condition (SW ). 
Since the I item gets only one unit ve 
trial (an RR), the repeated item in this 
condition will now have only two units 
added to it on each trial (one RR ior 
each pair in which it occurs). bue 
item in each of these pairs will RR 
get two units added per trial (one 2: 
and one PR). Thus, the ratio is d- 
when only RRs and PRs are du 
ered. This should be considera! : 
more difficult than the SR sages 
Where the ratio is 4:1 and also det 
difficult than the control conditio 
(2:1). 

The third condition investigated ani 
designed to produce maximum apt 
ference by making the frequency, ion 
very difficult to use. In this pu 
the repeated item is the C item in B 
of the two pairs in which it ogen hall 
the I item in the other pair; this : " 
be referred to as the same-both i 
tion (SB). Consider the pair in ives 
the repeated item is correct; it rece 
two units for being correct (one the 
and one PR) and one unit from ai 
fact that it is repeated in another m 
(an RR). The I item in this pai The 
ceives only one unit, an RR. 
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resultant ratio is 3:1 with the more 
frequent one being correct. However, 
in the pair in which the repeated item 
is incorrect, the situation is reversed. 
he repeated item has a frequency of 
aree (an RR and a PR from the other 
pair, and an RR from this pair), and 
the C item in this pair has a frequency 
of two (one RR and one PR). The 
resultant ratio is 2:3 with the less fre- 
quent item being correct. As a result, 
Rule 1 would be appropriate for the 
pair in which the repeated item is cor- 
rect, but Rule 2 would apply to the 
pair in which the repeated item was 
incorrect. Suchan arrangement should 
produce considerable interference, so 
the SB group should be inferior to the 
other three conditions, SW, SR, and 
the control. In summary, the fre- 
quency theory predicts that the SR 
group should surpass the control con- 
dition which should surpass the SV 
condition. Finally, the SB group 


should do poorest. 
As a second met 
or decreasing the frequency ratios 
within a VD pair, strong associates of 
some of the items in other pairs were 
inserted. Thus, one pair might. be 
Queen-Fast and another pair, King- 
Pepper. Instead of repeating Queen to 
add to its frequency, We reasoned that 
presentation of King would lead to an 
IAR which would most likely be the 
word Queen. Of course, to the extent 
that the associations are bidirectional, 

presentation of Queen would also lea: 

to an IAR of King. Underwoo 
dence that 


(1965) has independent evi 
fuse the RR to an item 


hod of increasing 


Ss may con t 
with the same response given as an 
TAR to a strong associate of that item. 
i actually 


For example, if King were 
presented, but Queen was never pre- 


sented, Ss would indicate, nevertheless, 
that Queen (the IAR to King) had 
actually been presented. The present 
manipulations were designed to explore 


TABLE 1 
SAMPLE PAIRS 


Control {Same Sami 
Right (SR) Wrong (SW) 
Queen*Fast Queens-Fast Queen-Fast^ 


Doors-Pepper Queen*-Pepper | Queen-Pepper^ 


Associate 
Wrong (AW) 
Queen-Fast* 
King-Pepper^ 


Associate 
Right (AR) 

Queen*-Fast 
King-Pepper 


Same 

Both (SB) 
Queen*-Fast 
Queen-Pepper^ 


Associate 

Both (AB) 
Queen*-Fast 
King-Pepper* 


s Denotes the correct item in each pair. 


the possibility, mentioned earlier, that 
IARs may contribute frequency units 
in the same sense that RRs, PRs, and 
RCRs are hypothesized to increase 
frequency. 

Using strong associates, the three ex- 
perimental conditions described above 
were repeated. In Condition associate 
right (AR), the strong associates were 
the C items in their respective pairs; 
in Condition associate wrong (AW), 
the associated items were the I items 
in their respective pairs; in Condition 
associate both (AB), one item would 
be the C item in its pair, and the asso- 
ciate of that item would be the I item 
in its pair. To the extent that Ss make 
IARs that are words in other pairs, 
the predictions for these associated (A) 
conditions are the same as those for 
the conditions in which an item is ac- 
tually repeated (S conditions). The 
AR condition should produce facilita- 
tion, the AB condition should produce 
interference, and the AW condition 
should fall in between AR and AB, 
and below the control condition. Ex- 
amples of the seven types of pairs are 


presented in Table 1. 


Design 
vas a 3X2 factorial, with 2 


The design W à 
relationships (S or A) 


types of interpair 
of lists (R, W, or B). This 


and 3 types 


52 B. 


TABLE 2 
Lists USED 


Control list S list A list 


Cold-Sicknesss 
Cold-Woman 
Chair-Head 
Chair-Foots 
House-Roughs 
House-Bread 


Cold-Sicknesss 
Hot-Woman 
Chair-Head 
Table-Foot= 
House-Roughs 
Home-Bread 


Cold-Sickness 
Boy-Woman 
Chair-Head 
Carpet-Foot 
House-Rough 
Dark-Bread 


Tiger-Wool Tiger-Wool Tiger-Wool 
Nail-Short Tiger-Short^ Lion-Shorts 
Queen-Fast Queen-Fast Queen-Fast 


Queen-Peppers 
Needle-Sweeta 
Needle-Shallow 


Door-Pepper 
Needle-Sweet 
Ugly-Shallow 


King-Peppers 
Needle-Sweets 
Thread-Shallow 


Road-Web Road-Web Road-Web 
Round-Doctor | Road-Doctor* Street-Doctors 
Hill-Smoke Hill-Smoke Hill-Smoke 
White-Baby Hill-Babys Mountain-Babys 
Steal-Soft Steal-Soft» Steal-Softs 
Loud-Apple Steal-Apple Thief-Apple 
Town-Low Town-Low= 

Bed-Cat Town-Cat 


Note,—In the Control list, the pairs are given in the 
order: C item—I item; Cold was correct, Sickness was 
Incorrect. In the A and S lists, both of the associated 
Qr repeated items were correct in conditions AR and 


^ Indicat 
repeated item was incorrect in the AB or SB condi- 
tions, respectively, 


produces the 6 experimental groups described 
above: SR, SW, SB, AR, AW, and AB. 
In addition, a control group was run. The 


an interaction of type of List x type of 
Interpair Relationship, 
predicted because the effect of type of list 
(R, W, or B) should be greater when the 
same item occurs in two pairs (S conditions) 


unit, but presentation of a Strong associate 
will add a frequency unit only if § makes 


Lists 


Each of the seven lists contained 20 VD 
pairs. The items were selected from the 
Russell and Jenkins (1954) and the Palermo 
and Jenkins (1964) Word-association norms. 
From these norms 40 Pairs of associated 
items were selected; it was attempted in the 
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process to keep interpair associations at a 
minimum. By selecting at random one word 
from each pair, the 40 presumably unrelated 
words for the control list were obtained. 
These 40 words were paired randomly to 
make the 20 VD pairs, and within each pair 
one word was randomly designated as the C 
item for that pair; the remaining item, of 
course, became the I item. . 

In order to construct the experimental lists 
10 pairs were chosen at random from the 
20 control pairs, In addition, the 10 I items 
from the remaining 10 control pairs were 
used. This provided a pool of 10 pairs an 
10 single items. From cach of the 10 pairs 
chosen from the control list, two pairs for 
use in the experimental lists were con 
structed. For example, one control pair 
was Queen-Fast; in the S list this became 
two pairs, Queen-Fast and Queen-Pepper 
Pepper having been an I item in one of the 
10 control pairs not used. In the A list, the 
pair Queen-Fast also was used to make uP 
two pairs, Queen-Fast and King-Pepber: 
By using the 10 control pairs and the 1 i 
items selected from the control list, 20 a 
perimental pairs were constructed with maxi- 
mum overlap between S, A, and control rue 
The pairings for the S, A, and control lists 
are presented in Table 2. sht 

Once the pairs were determined, the right 
and wrong members were designated as fo! 
lows: In the AR list, the two associate? 
items (e.g., Queen and King) were both dE 
rect in their respective pairs; in the AW list 
the pairs were simply "turned over" so bis 
the two associated items were wrong in En 
respective pairs (ie, Fast and Pepper pe 
correct). In the AB list, one of the tW 
related pairs (chosen at random) was pe 
over so that the I item in one pair was ir 
Strong associate of a C item in another pev 
(e.g., Queen-Fast, Queen is correct; Kng 
Pepper, Pepper is correct). For the S lis : 
the procedure was identical; in SR, '7 
repeated item was correct in both pair$ as 
which it Occurred; in SW, the list M 
turned over so that the repeated item M: 
incorrect in both pairs; in SB, the bons 
item was correct in one pair but incorr 
in the other pair in which it appeared. 


Procedure 
in 
The procedure employed group learn 
Sessions with the materials being prese d 
via a tape recorder. The learning ms a 
was alternate study and test trials W! tes 
modified anticipation procedure on the ere 
trials. Four study and four test trials W 
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administered to each of the seven groups. 
No guessing trial was used. The presenta- 
tion rate was 3 seconds for both study and 
test trials. 

On study trials, the two items in each pair 
were read, and then the voice on the tape 
repeated the correct one; for example, 

Oucen-Fast, Queen is correct.” Both the 
reading of the pair and the saying of the 
C item took place within the 3-second inter- 
val. This procedure insured that a frequency 


difference in favor of the correct alternative 


existed, because the C item was said twice 
ials, the pairs 


on the study trials. On test tri 

were again read at a 3-second rate, but the 
voice did not indicate correctness. The Ss 
recorded their responses on IBM answer 
sheets numbered from 1 to 80, with two 
alternatives. for each of the 80 numbers. 
As each pair was read, Ss indicated in the 
corresponding place on their answer sheets 
which member of the pair (first or second) 
was correct. Since the nature of this proce- 
dure does not allow S to make a PR, the 
frequency unit for the PR is eliminated. 
Instead, the voice on the tape spoke the G 
item twice on study trials to insure the 2:1 
ratio postulated by the theory. Across the 
four study and four test trials, the C item 
of each pair was read first four times and 
second four times. Within a trial, the order- 
ing of the C and I items was random. Eight 
random orders of the 20 pairs were used for 


the eight presentations. 


Subjects and Instructions 
The Ss were college students from intro- 
ductory psychology courses at Northwestern 
University; there were 20 in each of the 
seven groups. The seven conditions were 
1 of 14 group ses- 


each run twice, so a total E 
sions was administered. These 14 sessions 


were run at either 1:00 p.M. or 1:30 P.M. on 
7 consecutive school days. The Ss made an 


appointment for one of the 14 sessions at 
seven conditions 


their convenience. The n 
were assigned at random to the 14 different 
sessions, subject to the restriction that each 
condition occur once before any condition 


received its second session. he result was 
0, 21, or 22 Ss in each of 


that there were 20, 
the seven groups. IBM sheets were ran- 
domly discarded until each group consisted 
of 20 Ss. 

The Ss were re 


instructions 
y modifi- 
No men- 

repeated 


ad ordinary 


for VD learning with the necessar 


cations for the group procedure. i 
tion was made of the possibility o 


or associated items. 


RESULTS 


The results (mean number of total 
errors) are shown in Figure 1—acqui- 
sition curves revealed no interactions 
with trials. The base line in Figure 1 
represents an assumed continuum of 
similarity of implicit responses rang- 
ing from unrelated items (control) to 
associated items (A) to identical or 
same items (S). The plotting parame- 
ter is the type of list, either R, W, or 
B. In addition, for convenience of 
comparison, a dotted line representing 
the control mean is included. 

An initial analysis of variance was 
performed on all seven groups. This 
produced an F (6, 133) of 25.68 which 
is highly significant. This indicates that 
the different treatments did produce 
differences in learning. Nexta 3X 2 
analysis of variance was performed on 
the six experimental groups. The re- 
sults indicated that both main effects 
and the interaction were 
better than the .01 level. 
effect of A versus S interpa 


significant at 
The main 
ir relation- 


(WRONG) 


CONTROL 


R RIGHT) 


MEAN NUMBER OF TOTAL ERRORS 


(CONTROL) 


INTERPAIR RELATIONSHIP 
Fic. 1. Mean total errors in four ec of 
carni ix experimental groups 
earning for the six expe i 
va Fee control group. (The dotted line rep- 
resents the control mean and is presented 


for the convenience of comparison.) 
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TABLE 3 
MEAN TOTAL ERRORS FOR THE SEVEN Groups 
AND RESULTS OF THE DUNCAN MULTIPLE- 
RANGE TEST 


AW Control AR SR 
2.80 


SB sw AB 
21.50 11.35 11.30 8.00 6.90 3.05 


Note.—Any two means underlined by the same line 
are not significantly different (Edwards, 1960). 


ship produced an F (1, 133) of 18.48, 
that of type of list (R, W, or B) an F 
(2, 133) of 52.78. The interaction 
gave an F (2, 133) of 8.83. It can be 
seen from the df that these Fs are based 
on the within-groups term for all seven 
groups combined. 

The results indicate that the A lists 
were easier than the S lists. Second, 
the type of list (R, W, or B) is impor- 
tant: The R conditions are clearly su- 
perior to the W conditions which, in 
turn, are superior to the B conditions. 
The interaction indicates that, as pre- 
dicted, these differences between R, W, 
and B lists are more pronounced with 
the S interpair relationship than with 
the A relationship. 

The control-group mean falls ex- 
actly as predicted, between the R and 
W conditions. Thus, numerically, the 
W and B conditions show interference, 
and the R conditions show facilitation. 
To evaluate the differences among the 
seven groups, a Duncan multiple-range 
test was performed using the .05 level 
of significance (Edwards, 1960). The 
Tesults are shown in Table 3 where 
the seven means are given. 

Within the S conditions, SR is sig- 
nificantly superior to the control condi- 
tion whereas SW and SB are sig- 
nificantly inferior to the control. These 
four groups are thus ordered and differ 
significantly as predicted by the theory, 
Furthermore, since the SW group dif- 
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fers from the control group, inter- 
ference in this group has apparently 
been produced by repeating an I item 
in another pair. 

Within the A conditions, the results 
are not as conclusive as for the S con- 
ditions. The four groups (including 
the control) are ordered in the same 
way as in the S conditions, but, as 
expected, the differences among them 
are considerably smaller. The AR 
group is significantly superior to the 
control group, and AB is significantly 
inferior to the control, indicating facili- 
tation and interference, respectively. 
The AW group, however, is only 
slightly and insignificantly inferior ie 
the control group, which may indicate 
that there is no interference from a 
strong associate being in the list when 
both associated items are I items. 


Discusston 


The results of this experiment pro" 
vide substantial support for the fre- 
quency theory. Both main effects an 
the interaction in the 3 x 2 design were 
Significant at the .01 level in the pte 
dicted direction, Within the A condi- 
tions and within the S conditions, the 
Sroups were ordered exactly as pr 
dicted. By increasing the difference j 
frequency between the C and I ud 
(R conditions), performance was im 
proved, and by decreasing this differ- 
ence (W conditions), interference, a 
though small in amount, was apparen 
Much more striking inhibition was 2p 
parent in the B conditions which pios 
designed to produce maximum di 
culty for frequency discrimination. — 

The results would also tend to ee 
Port the idea that these frequencl™ 
can be manipulated by presenting d 
ciates of some items in other Pai 
However, due to the small effects is 
this Operation, a strong conclusion : 
not indicated. Nevertheless, it ape 
that under optimal conditions, the P 
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Similarity 
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tend to increase the frequency of Queen, 
and vice versa, just as if King and 
Queen were in two different pairs. 
Thus, the Underwood-Viterna condi- 
tions were quite similar to the AB con- 


dition which showed evidence of inter- 


ference. 


Meaningjulness 


The theory would seem to require 
nat, with integrated 


the prediction tl 
units, meaningfulness of the units (in 
terms of frequency) should have no ef- 
fect. If the two units in a VD pair 
are of equal frequency, whether that 
frequency is high or low on an abso- 
lute scale, adding an additional fre- 
quency unit to either item should make 
the pair more discriminable. The the- 
ory might even predict (according to 
Weber's law) that adding one unit to 
an item already high in frequency 
should be less noticeable than adding 
one frequency unit to an item low in 
frequency: This would mean that 
irs of very-high-frequency units 
ore difficult than pairs of 
ncy units. Assuming that 
plished outside the 
ble to frequency 
units esta RRs, PRs, and 
:n the laboratory, one might then 
f high-frequency 


dict that VD pairs o 
D e more difficult than 


frequency words. How- 
icability of Weber's law 
ituation has not been demon- 
d neither has the equality 
d extraexperimental 


difference in trials 
1969) b Detween a VD list of non- 
sand a list of words. 

at VD pairs 
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low- or medium-frequency words. It 
should be pointed out, however, that 
the difficulty of the high-frequency list 
may be a result of a greater number 
of interitem associations in that list 
(Deese, 1960). To the extent that 
there are interitem associations in 
Postman’s high-frequency list, it is 
similar to the AB list of the present 
experiment (an interference condition) 
which may explain why it was more 
difficult than the medium- and low- 
frequency lists. 

If a detrimental effect of low mean- 
ingfulness is found for VD learning, 
the theory would predict that this 
would be with nonintegrated units in 
which the RRs, PRs, and RCRs are 
letter responses, instead of responses 
to the entire unit. To the extent that 
there are repeated letters in the list, 
the frequency theory would predict in- 
terference, since the RR to a noninte- 
grated unit could easily be confused 
with the RR to another unit with simi- 
lar letters (e.g., Underwood & Archer, 
1955). Moreover, the lower the inte- 
gration, the less reliable the RR is 
likely to be; that is, the less similar two 
RRs to the same unit will be. Until 
the unit is integrated sufficiently for 
RRs to be reliable, we cannot expect 
a buildup of a particular PR that can 
be used as a frequency cue, 

The theory need not necessarily pre- 
dict a difference in VD learning as a 
function of meaningfulness or frequency 
of occurrence of words in the language 
since extraexperimental frequency may 
not be comparable to experimentally 
induced frequency. However, if fre- 
quency differences are induced entirely 
within the experiment (possibly by fa- 
miliarization training), it is possible 
that a VD pair of two high-frequency 
units will be more difficult than a VD 
pair of two low-frequency units. Other 
things being equal (interitem associa- 
tions in particular), an extra frequency 
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unit added to a member of a high- 
frequency pair should not be "worth 
as much" in terms of cue value as à 
frequency unit added to a member of 
a low-frequency pair. 


Presentation Rate 


The theory makes a rather unusual 
prediction with regard to the effects of 
anticipation time. A large increase in 
anticipation time, with study time held 
constant, should have no effect on VD 
performance, or perhaps even an 1n- 
hibitory effect. If S is forced to spend 
extra time studying the two units 10 
a VD pair before he is allowed to make 
his choice, performance could suffer. 
This additional time should result i" 
additional RRs being made to the © 
and T items. If the same number © 
RRs is added to the C and I items by 
this extra time, the relative frequency 
difference can only decrease, making 
the discrimination more difficult (again 
assuming the applicability of Wee 
law). An analogy can be made € 
the strategy often suggested for taking 
multiple-choice examinations; namely: 
“stick with your first choice.” Con- 
tinued study of the alternative answers 
should tend to reduce their discrimina 
bility. On the other hand, if the study 
time after reinforcement is increase 
the theory predicts better performanen 
on the assumption that this time will oe 
Spent mainly in the act of makin’ 
RCRs. Increasing the study time after 
reinforcement should increase the ntum 
ber of RCRs relative to the number z 
RRs to the I item; any such not 
should result in a more discriminab 
frequency difference. 


Forgetting 
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If one subscribes to a strict intet” 
ference theory of forgetting an 
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Process, ^ 
Nu the standpoint O 
Sto uer interpretation, the theory 
lie predict that increasing the num- 
effe of pairs in a VD list will have no 
tn i. on the number of trials require 
Meo or that list. This prediction, 
Owever, will hold only if it is possible 


to inc : T ; 
© increase list length without also in- 
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Creasi h 
c Eg the number of interitem a550- 
ations, Such an increase in interitem 
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losoclations would ten! 
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framework 
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800d accounting of the availa a 
t the same time, 
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ere is a critical finding Y nearest 
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al approach. This finding S 
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in erence in learning -— en 
E ron repeated items " 5 were 
and a list in which repeat f the latter 
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list resulted in some interference (when 
compared with a control) only adds to 
the problem with which other theories 
must cope. 

The assumption that differences in 
frequency of events provide a discrimi- 
native cue should have applicability to 
phenomena beyond those associated 
with VD learning. Some possibilities 
for such extensions of the theory may 
be mentioned briefly. It has often been 
noted that in tasks where verbal units 
must be recalled (paired associates, 
free learning) it is a rare occurrence 
for S to produce a unit that is not in 
thelist. Even if the response units are 
10 single letters drawn randomly from 
the 26 letters, S will rarely produce 
one of the 16 letters not in the list. It 
is as if some selector mechanism (Un- 
derwood & Schulz, 1960) quickly and 
differentiates between the 


effectively 

appropriate and inappropriate pool of 

responses. A frequency discrimination 
e for this selective 


may be responsibl 
capacity. The letters in the list have 


a situational frequency of at least one 
each, those not in the list have none, 
thereby providing the discriminative 
cue. Even if S cannot recall response 
terms which are in a list, he is able to 
reject response terms which are not in 


ing the second 
4). Furthermo 
appear after several 


second list (Schwenn & 
. This might be 
that intrusions are 
when the associa- 
responses elicited 
s become equiva- 
y-discrimi- 
rou redict this phe- 
theory a rounds that after the 


e mean 
; to occur 
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second-list responses have occurred 
oeral times the frequency discrimina- 
Sevens reen first- and second-list re- 
— pep bats down as the frequency 
"io dhe second-list responses approxi- 
pains that of the first. : 
Intrusions at recall in retroactive- 
inhibition studies decrease sharply in 
frequency as the degree of interpolated 
learning increases beyond the level of 
original learning (Melton & Irwin, 
1940), although amount of retroactive 
inhibition does not. The reduction in 
intrusions would be expected from the 
frequency-discrimination theory since 
the higher degrees of interpolated 
learning make the frequency discrimi- 
nation between first- and second-list 
responses easier. 
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2 variables: 


discuss research tas 
normals, acute schizophrenics, 


results which have seemed contradictory can be clari 
(a) response disorganization characterizes both 


propositions that: 


acute and 
with response disorganization by o! 


Consider these statements, each of 
which has good research support: 
Overinclusive concepts have been 


la. 
hizophrenies (C hap- 


found in chronic sc 


man, 1961). y 
1b. Overinclusive thinking is found 


in many acute schizophrenics, but chronic 
schizophrenics are not abnormally over- 
inclusive (Payne, 962). . 

2a. Chronic schizophrenics attend to a 
narrower range of stimuli than normals 
( Venables, 1964). p 

2b. Chronic schizophrenics are more 
distractible than normals (Chapman, 


1956). 

When faced with such inconsistency We 
can either seek further data that will fa- 
vor some conclusions and oppose others, 
or we can search for clarifying distinc- 
tions, and then use these distinctions to 
guide later research. The reliability of 
the data concerning the four statements 
listed above is such that it seems worth- 
while to follow the latter course and think 
through the reasons for 


what may be only 
arent inconsistencies. Discussion will 


(a) degree of org: 
and (b) breadth of observation, are re 
ks on which di 
and chronic schizophrenics. 


chronic schizophrenics ; and 
bserving fewer stimuli. 


anization of competing responses; 


lated to each other and used to 


fferences have been found between 


r Research 
fied by the basic 


(b) chronics attempt to cope 


RESPONSE DISORGANIZATION 


Response disorganization refers to a 
lack of clear hierarchical ordering of com- 
peting response tendencies; that is, no 
single response tendency is highly domi- 
nant. The first principle to be used in 
the discussion is that many schizo- 
phrenics, both acutes and chronics, have 
abnormal response disorganization. More 
specifically, in a situation where a num- 
ber of alternate response tendencies are 
evoked, in normal persons these response 
tendencies are hierarchically ordered 
with appropriate responses clearly domi- 
nant, but for disorganized schizophrenics 
the probabilities of the alternate responses 


are more nearly equal. Interference from 
is therefore 1n- 


competing responses | l 
creased. The increase in intrusions of 
responses which normally have low prob- 
ability is seen as a major characters 
of schizophrenia. This principle, which 
is basically similar to Cameron s (1938) 
description of Sc i 


app 
be based on two processes: (a) xcu (1966) mime 
à S : ved ob- ie di h 
log a ant Gd Ex Bae pnormal response disor- 
FE ganization One way of estimating the 
i written while the author yi f this process comes from the 
eei in Lecturer at the University extent iG n by Broen and Storms (1966) 
Fa Pec Amherst. Robert Sue Maher (1966) that the widely noted 
x / tz gave : B 
LS E eon earlier F ge tendency of many schizophrenics toward 
helptu comme! 
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overinclusive thinking may be a conse- 
quence “of a more fundamental process 
by which all or many of the possible as- 
sociations to a word are at equal strength 
[p. 413]." If this suggestion bi Seats 
then the extent of overinclusive thinking 
may provide an estimate of the extent 
of the underlying process, which is here 
called response disorganization. As will 
be discussed later, some measures of 
overinclusive thinking may be influenced 
by more than one underlying process, 
especially in chronic schizophrenics. 
However Payne's (1966) statement that 
overinclusion ". . . is found in at most 
50 per cent of a typical sample of acutely 
ill mental patients who are diagnosed as 
schizophrenic [p. 95]" may provide an 
estimate of the extent of response disor- 
ganization in acute schizophrenics. AI. 
though lack of complete uniformity must 
be kept in mind, research done on the 
heterogeneous groupings "acute schizo- 
phrenics” and “chronic schizophrenics” 
will be discussed as relevant material be- 
cause the effects of the processes being 
discussed should make group averages 
differ in the expected directions, 

To understand the effects of response 
disorganization, it is important to note 
that the different response tendencies, 
which many schizophrenics cannot or- 
ganize adequately, may be evoked in two 
ways. First, a single stimulus may elicit 
several alternate responses. An example 
is a word in a word-association task 
which elicits several different associa- 
tions. A good example of reduced hier- 
archical organization among alternate as- 
sociates to a single stimulus was given by 
a schizophrenic: P 


My trouble is that I've got too 

thoughts. You might think about some 
let's say that ashtray and just think, oh! yes. 
that's for putting my cigarette in, but I 
would think of it and then T would think of 
a dozen different things connected with it 


at the same time [McGhie & Chapman, 1961, 
p. 108]. 


many 
thing, 


Second, competing responses may be 
evoked by different stimuli. Consider 
what happens when a person enters a 
multiple stimulus Situation. A number of 
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different stimuli are observed, cach pre- 
sumably evoking a tendency to focus on 
that stimulus. Normally these competing 
focusing responses would be hierarchically 
ordered in accordance with learning and/ 
or instructions ; the strongest focusing 
tendencies evoked by the kinds of stimuli 
which are most important for appropriate 
performance. (Harlow, 1959, had dis- 
cussed his research on learning set, m 
similar terms.) Response disorganization 
would mean that these competing atten- 
tion tendencies would be less clearly or- 
ganized, that is, attention would be more 
Scattered in schizophrenics than in nor- 
mals. As stated by Shakow (1962): 


It is as if, in the scanning process which 
takes place before the response to a stimulus 
is made, the schizophrenic is unable to selec 
out the material relevant for optimal it 
sponse. He apparently cannot free himse ^ 
from the irrelevant among the numerous pos 

sibilities available for choice [p. 9]. 


Attention to irrelevant stimuli increases 
the likelihood that additional competing 
Tésponse tendencies are evoked. 50 ga 
problem is compounded for the disorgav" 
ized schizophrenic, He not only ‘has y 
normal variation among competing 7 
Sponse tendencies, but also, because " 
his scattered attention, he is likely to ae 
à greater number of competing i ue 
tendencies. Exceptions to this statem? 
will be discussed in the next section. 


NARROWED OBSERVATION 


The second major principle is that i 
schizophrenics, primarily nonparam? 
chronics, learn to reduce interfering We 
Sponses through reduced observation. ng 
have discussed lack of organization ay 
competing response tendencies evoke 
different stimuli as one major puc 
interfering responses in schizophre re- 
Reduced observation should tend t9 jon. 
duce this source of response confusi 
Reduced observation is a process There" 
would be learned over time and T Re- 
fore expected primarily in chronics. 
duced observation would also pa um 
Primarily by those schizophrenics 
most need reduced response interfe 
and for whom reduced vigilance 15 
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c " r 
E e DER there is some 
He one caret is 4 t t paranoids may 
than other iiu iss a 
and ':bécause oe se a ( Lester, 1960),? 
comfortable Ju sion € =e 
reduced observ 7 reduced vigilance, that 
a r gee is expected primarily 
will be mie H chronies: New situations 
Piece. a poitea by these chronic schizo- 
Bssrvati tis just that their breadth of 
erus. ion is not as great. Reduced ob- 
mra would, of course, decrease re- 
ii complexity only when the alternate 
eee were stimulated by different 
M Shel not when they are responses to 
Single stimulus which is observed. 
Following Broadbent (1958), it is hy- 
den soar that observation is not reduced 
ie mly. Instead. observation is guided 
diac the lines of discriminable stimulus 
tion es (e.g. classes based on color, posi- 
mod. d. stimuli from a single sensory 
ei. ity), with some classes ignored = 
ity nn observed normally. The probabil- 
hat any particular stimulus class wil 

€ noticed is a function of such stimulus 
characteristics as: physical intensity 
(Broadbent, 1958) ; central position; and 
relian will be called stimulus pul » 
Sn lonship between stimulus pu hae 
in nulus relevance is then quite importan 
scl determining the quality of chronic 
Tzophrenics' performance in any partic- 
ular task, [n discussing past research. 
Stimulus “pull” will be defined in terms of 
What would be commonly expected ; for 
Sxample, other things being equal. stimuli 
Which are central in à stimulus display 


h saat ee 
ave stronger “pull” than stimuli which 
ure research 


y peripheral. However, fut ^ 
xu use better definitions. One adequa 
"finition would be obtained by 
wie does not mean that schizophrene 
ra, Paranoid delusions are not disorgan ns 
are &mented and possibly paranoid a m 
spong rected in the context of gena ^. 
1966)" disorganization (Broen sp m 
that a However, it seems to be S 
tends diagnosis of paranoid “© 
tured to be applied when relati 
i Paranoid delusions are mor 
sore symptom picture ‘and general 
Eanization is somewhat less. 


showing 


vely St 
prominent 


response 


the research situation to a reference group 
from the same population as an experi- 
mental group, having instructed the ref- 
erence group to "tell me (or point to. 
etc.) what you notice first" The fres 
quencies that different stimulus classes 
are indicated could define relative stimu- 
lus pull. 

There is considerable research support 
for the idea that chronic nonparanoid 
schizophrenics use a subnormal range of 
stimuli (reviewed in Venables, 1964), al- 
though whether the reduction is early in 
the attention process, as discussed here, 
or occurs later, is an open question. The 
question of causality is also more specula- 
tive than the principle that reduced obser- 
vation does occur. The hypothesis that 
reduced observation is an attempt to cope 
h an increase in response interference 
cing the sources of stimulation is 
o ideas in discussions of schizo- 
Freeman (1960), and Zahn, 
d Shakow (1963). This no- 
tion that narrowed observation is a re- 
action to response interference is sup- 
ported by Bursill’s (1958) results indi- 
cating that, in a complex visual signal 
monitoring task, observation tends to be 
narrowed to more central stimuli when 
perceptual load is increased. (Bursill's 
research used normal subjects. Chronic 
schizophrenics are hypothesized to have 
learned more constant and extreme re- 
ductions than the temporary response to 
certain situations seen in normals.)* 


wit 
by redu 
similar t 
phrenia by 
Rosenthal, an 


has also written an 
f some research on at- 
using the concepts 
"sensory input 
» which have 
o "breadth of observation." 


ome similarity t f atio 
He also used a “field articulation” principle 
«mates the inverse of "attention 
pose of our 


which approx nate aioe pum 
type of concepts 
964) and Vena- 
ept of “response dis- 
ful in account- 
hizophrenics 


3 Silverman (1964) 
cussion O 


tention in schizophrenia, 
ing" and 


4) to the conc! 
16n » which seems use 
research on sc 


T 
rms, 1966). 

tement does not 
normals 


contradict evi- 
consistent indi- 
ask stimuli 
at, in addi- 


Broen 
4 This sta 


582 


Venables (1964) suggested that the re- 
duced use of stimuli found in chronic 
schizophrenics is the result of increased 
arousal. This suggestion does not neces- 
sarily contradict our discussion in that 
there are several studies which suggest 
that increased arousal increases response 
disorganization in schizophrenics (Broen 
& Storms, 1966). Callaway and Stone 
(1960) have also suggested that arousal 
may lead to greater equivalence of prob- 
abilities of competing responses which 
people attempt to handle by reduced ob- 
servation. 


SUMMARY Propositions 


What has been implied thus far can be 
summarized as three propositions. Each 
proposition is based on processes which 
are hypothesized to affect major sub- 
groups of the specified groups, and should 
be understood as referring to average 
group responding : 

1. When a task has a single outstand- 
ing relevant stimulus which evokes sev- 
eral response tendencies, normals and 
schizophrenics will tend to have the same 
dominant and competing responses, but 
both acute and chronic Schizophrenics 
will show more disorganized variation 
among these responses than normals. 
One area of empirical support for this 
proposition has been found when word- 
association tasks are repeated. Acute 
schizophrenics have greater variation in 
their associations than normals (Storms 
& Broen, 1964), and the same abnormal 
response instability has been found in 
poor premorbid chronics (Dokecki, Poli- 
doro, & Cromwell, 1965). 

2. In multiple stimulus tasks, acute 
Schizophrenics will still be near normal 
in their breadth of observation and in the 
stimuli they observe, but their response 
disorganization means that attention hier- 
archies will be less organized than is the 


tion to these consistent individual differences, 
normals show temporary reductions in 
breadth of observation under increased per- 
ceptual load. Because attention disorganiza- 
tion (which increases perceptual load) is 
more constant and extreme in Schizophrenics, 
chronics learn consistent and more extreme 
narrowing of observation. 
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case for normals. Acute schizophrenics’ 
attention will be abnormally scattered 
among the stimuli they have observed. 
Approximately normal breadth of obser- 
vation together with disorganized atten- 
tion means that acutes have attention 
scattered over a greater range of stimuli 
than normals or chronic schizophrenics. 

3. In multiple stimulus tasks, chronic 
schizophrenics (especially nonparanoids) 
do not observe as many stimulus classes 
as normals or acute schizophrenics. The 
Proportion of irrelevant to relevant stim- 
uli observed by chronics depends on 
whether or not relevant and irrelevant 
stimuli are from different, easily dis- 
criminable categories, and on the relation- 
Ship between stimulus pull and relevance. 
Like acutes, chronics have abnormal re 
sponse disorganization so their pes 
Will be abnormally variable, but only 
within the limited number of stimuli they 
have observed, 


Two KINDS or OvERINCLUSION 


We can now turn to the apparently E 
tradictory statements, Statement A 
“Overinclusive concepts have been S 
in chronic Schizophrenics," refers to t i 
results of a task where single an 
(words) were sorted into one of 61) 
conceptual classes (Chapman, 19 aft 
From Proposition 1 above, if sting 
evoke both correct and incorrect sorti re 
tendencies, chronics should make d 
Sorting errors than normals, that T 
should overinclude some words in 2 e 
rect classes, This is exactly what Sta i 
ment la means. For example, in one P 
of the study, subjects were asked to arts 
the names of fruit, vegetables, and ree 
equipment into two classes: “fruit ples 
“not fruit.” Sorting names of vu 
was most difficult. For normals the a e 
nant response was to place vegetables C^, 
rectly in the “not fruit" category ral- 
presumably because of mediated. ms e 
ization there was also a competing pr 
ency to place vegetables in the fruit hre- 
gory. Like normals, chronic med 
nics made most of their errors in rene 
these objects which elicit comp had 
responses. Chronic schizophrenics s- 
the same dominant and competing 


r 
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sponses as normals, the difference was 
simply that the probabilities of domi- 
nant and competing responses were 
closer together for the chronics. In 
other words, chronics included more 
vegetables in the fruit category. This 
kind of overinclusion is simply disorgani- 
zation among the alternate responses to 
the stimulus in single-stimulus tasks. As 
has been noted, acute schizophrenics also 
show disorganized responding in single 
stimulus-multiple response tasks, and 
similar results would be expected for 
acutes on this type of Chapman sorting 
task. Downing, Ebert, and Shubrooks 
(1963) showed that acutes tended to make 
the same kinds of errors as Chapman's 
chronics did on a grossly similar sorting 
task, but some task differences and the 
lack of results from a normal group make 
conclusions uncertain. 

The meaning of overinclusion in State- 
ment 1b is different. This statement 
refers to the number of different kinds of 
stimuli which are used in a multiple-stim- 
ulus task (Payne, 1962). Subjects were 
shown a number of objects and were 
asked to sort the objects repeatedly into 
groups that belong together, using as 
many different simple reasons for group- 
ing the objects as they could think of. 
There were many possible stimulus cate- 


gories which could be used. Normals 


tended to focus on stimuli such as shape 
only rarely 


of objects, color, or weight ; 

using characteristics such as shape of 
shadows cast by the objects, or whether 
or not the objects were scratched. (With 
nondirective task requirements, attention 
was organized according to past learning 
about the utility of different stimulus 
classes.) For acute schizophrenics, nor- 
mal observation together with not having 
as hierarchically organized attention 
should lead to responding to more stimuli 
than normals—using the stimuli that nor- 
mals focus on, and also using the stimuli 
that tend to be ignored by normals 
(Proposition 2). However, because 
acutes are expected to use more stimuli 
than normals, boredom or fatigue effects 
must be considered. To the extent that 
these factors affect performance, we 
would not match normals in the use of the 


583 


stimuli on which normals focus. The 
results (Payne, 1962) fit these expecta- 
tions. Acutes produced relatively fewer 
sortings based on the kinds of stimuli 
normals tended to focus on, and more 
sortings based on stimuli that normals 
used only rarely (first part of Statement 
1b). Overall, acutes used more types 
of stimuli than normals. H 
Chronic schizophrenics, with reduced 
range of observation, would have fewer 
kinds of stimuli eliciting competing focus- 
ing responses. Chronics have the basic 
schizophrenic response disorganization, 
but attention should be scattered only 
within the limited range of stimuli which 
are noted. In the Payne task, the 
stimuli which normals used most, and 
those stimuli normals used rarely, both 
included stimulus classes with relatively 
little stimulus pull. Reduced observa- 
tion should, therefore, reduce noticing 
both kinds of stimuli (Proposition 3). 
Thus, one effect of reduced observation 
should be that chronics will show sub- 
normal use of those stimuli on which nor- 
mals usually focus. Payne (1962) found 
this reduced use of normally dominant 
stimuli in chronics. The picture should 
be different for stimuli that normals 
rarely focus on. For these stimuli, re- 
sponse disorganization and reduced ob- 
servation should counteract each other in 
effect on differences between normals and 
chronics; reduced observation in chronics 
being offset by normals tending to focus 
on other stimuli. This is also consistent 
with Payne's data. Chronics did not use 


abnormally many unusual siii ve 
i i 

-istics as bases for sorting. ^. 7. 

acteristics as ba dor am 


what is meant by lack of overi 
chronics in Statement 1b. 


NARROW ATTENTION AND DIS- 
TRACTIBILITY IN CHRONIC 
ScHIZOPHRENICS 
that reduced range of ob- 
reduced organization of 
i nses both occur in 
competing  TeSpon** : 
Meet schizophrenics may clarify oe 
rently contradictory statements at 
nies have narrowed attention, and 
EE are abnormally distractible 


Recognizing 
servation and 
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(Statements 2a and b). Research results 
depend on the research task. The extent 
of the chronic’s distractibility, relative to 
normals, depends on the likelihood that 
his limited observation will include stim- 
uli which are considered to be distractors. 

When the stimuli with strongest pull 
are relevant to task requirements, and 
other relevant and irrelevant stimuli are 
weaker, chronics will observe the stronger 
(relevant) stimuli; their subnormal ob- 
servation only reflected in ignoring 
peripheral stimuli. Attention will be 
scattered only among the relevant stimuli 
that are observed. In the sense of re- 
duced use of peripheral stimuli, chronics 
will not be more distractible than normals. 
This does not mean that chronics will do 
better than normals even when relevant 
stimuli are few and all have strong pull, 
In such situations normal broad observa- 
tion followed by appropriate attention 
focus should offset the chronic’s reduced 
observation of irrelevant stimuli. The 
clearest predictions regarding differences 
between normals and chronic schizo- 
phrenics are: (a) chronic schizophrenics 
should do relatively better when relevant 
stimuli have stronger pull than when all 
relevant stimuli are peripheral (eg. 
chronics should show less impairment in 
directed intentional learning than in in- 
cidental learning) ; and (5) when rele- 
vant stimuli have strong pull but addi- 
tional relevant stimuli are peripheral, 
chronics should do poorer than normals 
(e.g. in distance-constancy experiments). 
These predictions are consistent with 
considerable evidence which has been dis- 
cussed in detail by Venables (1964). In 
these studies, the chronic's Observation 
tends to be abnormally restricted to the 
relevant stimuli with strong pull, and the 
basic difference from normals is that 
chronics ignore weaker relevant stimuli 
Which normals, with their broad observa- 
tion followed by appropriate focus of at- 
tention, do use. This is what is meant by 
Statement 2b. The chronic’s tendency to 
focus on what is relevant, together with 
ignoring other stimuli that are used by 
normals, is what may seem inconsistent 
with saying that chronic schizophrenics 
are distractible, 


THEORETICAL Notes 


Chronic schizophrenics will be more 
distractible when there is little relation- 
ship, or a negative relationship between 
stimulus pull and stimulus relevance. In 
such cases the ratio of observation of 
relevant to irrelevant stimuli will not be 
high, and because reduced observation 
follows lines of stimulus pull, this ratio 
will be unchanged or made worse by re- 
duced observation, Because of task re- 
quirements, in normals the initial observa- 
tion will be followed by attention hier- 
archies which favor relevant stimuli. 
Attention will be more disorganized 1n 
chronics, leading to distraction by the 
relatively high proportion of irrelevant 
stimuli among the stimuli observed. 
Situation with little relationship between 
stimulus pull and relevancy is illustrated 
by another card-sorting task used by 
Chapman (1956), On each trial normals 
and chronic schizophrenics were 1- 
Structed to place a response card with 
one of three sorting cards on the basis ° 
figures on the response and sorting cards 
having a concept im common or being 
identical. There were three figures 0” 
each card. Subjects were instructed t? 
use only the upper right figure on the 
response card although all three figures 
on each of the sorting cards were rele- 
vant. Note that reduced observation 
would not favor relevant stimuli on “n 
basis of type of stimulus, because rele 
stimuli could be any type of figure. Cor 
fining observation to relevant stimuli 0” 
the basis of position was also unlikely b& 
cause different stimuli were close í? 
gether, relevant stimuli were not in cef" 
tral position, and relevant stimuli wer 
in different positions in response car 
than on sorting cards, Thus, reduce 
Observation would do little to Ld 
noticing relevant as compared to irrele 
vant stimuli, and both normals po 
chronic schizophrenics should observ" 
irrelevant as well as relevant stim 
uli. However, the more organize p 
Sponse hierarchies in normals would ee 
two things: (a) provide more attenti e 
focus on relevant stimuli; and (b) Er 
clearer dominance of appropriate d IE" 
Competing sorting tendencies if both F 
vant and irrelevant commonalities 4 


L- 


- 
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a on a single trial. So normals 

Decet Ls ot (Chapman, 1956), make 

This is IE ased on irrelevant stimuli. 

eod dics meaning of ‘chronic schizo- 

mals” in a more distractible than nor- 
atement 2b. 
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ON THE INCOMPATIBILITY OF THE HOUSTON AND > 
OSGOOD TRANSFER SURFACES 


CHARLES P. THOMPSON 


Kansas State University 


A comparison of the Houston and Osgood transfer surfaces demon- 


strates that they are incompatible. 


Osgood’s (1949) transfer surface pre- 
dicts the direction and magnitude of 
transfer as a function of the degree of 
S,S, and R,-R, similarity. Recently, 
Houston (1964) has suggested a re- 
labeling of this surface using the degree 
of S,-R, and S,-R, similarity for the 
coordinates. The two surfaces are shown 
in Figure l. Since Houston restricts the 
application of his surface to formal simi- 
larity, only that portion of Osgood's sur- 
face applicable to formal similarity is 


HOUSTON 


TRANSFER: o 


OscooD 


TRANSFER o 


Fic. 1l. The Houston and Osgood transfer 
surfaces. (On both surfaces, zero—0—rep- 
resents no similarity and one—1—represents 
identity.) 
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shown; that is, the extreme right portion 
of Osgood’s surface representing response 
antagonism has been omitted. E 
These surfaces invite comparison, since 
both represent important attempts to sum 
marize the effects of a potent variable 1 — 
transfer. However, such comparison 
demonstrates that the two surfaces aa 
incompatible This incompatibility ! 
perhaps most strikingly evidenced n 
those instances in which some degree te 
similarity exists for at least one of th 
dimensions in both surfaces. Such w 
stances may be produced by using br 
lists, two of which are totally dissimila, 
(hereafter designated as Lists A and 
and a third which is similar to both 2 
the other lists (hereafter designated ^ 


List A/B). Sample lists of this tyP® 
(using bigrams) are given in Table 1. 
TABLE 1 b 
SawrLE Bicram Lists or tHe A, By AN 
A/B Tyee 
List A List B List A/P - 
AM WH WA 
RT GH GR 
KE FI FE 
XP SC SP 
QU CL UL 
AB LD AD 
EX IG IE 
PR OW E 
BU NS ies 
TY FO 9 


r a5 
1It should be noted that the did re^ 
sumes that both transfer surfaces 477 C. 


dicting performance relative to an A- enable 
control paradigm. This seems reas“ çD " 
since both surfaces indicate the A-P: 


paradigm as a zero-transfer condition 
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TABLE 2 


TRANSFER ParapicMs Usinc A, B, AND A/B 
Lists AS STIMULI AND RESPONSES 


Dm Houston Osgood 
A-B = r3 
A/B-B 
AB + E 
A/B-A 
A/B-A 0 fe 
B-A 
A-A/B ge 0 
B-A 
B-A ? 0 
A-A/B 
B-A/B ? 0 
A-A/B 
B-A 0 ? 
B-A/B 
B-A/B —(Max) ? 
A/B-A 
A/B-B 0 ? 
A-A/B 
A/B-B ? = 
B-A 
A/B-B ? —(Max) 
A/B-A 
B-A/B - t 
B-A 
A-A/B +(Max) ? 
A/B-A 
indicate the direction of 
voters Sleted by find ar and Osgood. 
i B lists as the stim- 
m a pom can construct à 


uli and responses, 7 
al of 13 functionat y. ct 
ET not including trivial cases 


i he restriction 
as A-A which meet t esti 

d mi Pub of similarity obtaining for 

EA f the dimensions im both 
These patterns are 

r E i» 
iere Table 2. Of these, there are Ar 
conditions for which Houston and Osgoo 


distinct transfer 
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clearly make contradictory predictions. 
In two of these cases, one author predicts 
positive transfer while the other predicts 
negative transfer. The first of these two 
cases is the transfer paradigm A-B, 
A/B-B in which List A is used for the 
S,'s, List A/B is used for the S,’s, and 
List B is used for the R,’s and R,’s. In 
this case, Osgood predicts positive trans- 
fer since the responses are identical and 
the stimuli are similar. Houston, on the 
other hand, predicts negative transfer 


TABLE 3 


TRANSFER Paranicms Usixc A, B, A/B, AND 
C Lists AS STIMULI AND RESPONSES 


EE Ee Houston Osgood 
C-A/B E 0 
A-B 
C-A E 0 

A/B-A 
C-A — (Max) 0 
A-A/B 
A-C 0 = 

A/B-A 

A/B-C 0 - 
A-B 

A-C 0 —(Max) 
A-A/B 

A-A/B - —(Max) 
A-C 

A/B-A —(Max) - 
A-C 
A-B = = 

A/B-C 
A-B 0 0 
C-A/B 
A-A/B 0 0 
C-A 

A/B-A 0 0 
C-A 


d 3 indicate the direction of 
Note Coad by Houston and Osgood. 


transfer predicted by 
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; re is no similarity between S, 
EE S, and R, are similar. The 
second transfer paradigm for which pei 

ite effects are predicted is the A-B, 
pa B-A condition. Here, since the re- 
ae are totally dissimilar and the 
stimuli are similar, Osgood predicts nega- 
tive transfer. However, since S, and R, 
are identical and S, and R, are similar, 
Houston predicts positive transfer. 

Indeed, an inspection of these 13 trans- 
fer paradigms demonstrates that, in all 
cases, Houston and Osgood predict differ- 
ent points on the transfer surface, How- 
ever, since (a) a large portion of the 
transfer surface predicts no transfer, (b) 
assumptions about location must be made 
at points other than identity or total 
dissimilarity, and (c) the Surfaces are 
obviously not of equal magnitude (e.g., 
consider the point of maximum transfer 
for both surfaces), Houston and Osgood 
do not necessarily make contradictory 
predictions in all such instances, 

Adding another dissimilar list (here- 
after designated List C) Senerates, under 
the restrictions noted above, 12 more 
functionally distinct transfer conditions. 
These paradigms are listed in Table 3. 
There are two Points to be noted for 


these conditions: (a) The A-B, A/B-C 
and A-B, C-A/B i 
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their location on the transfer e pe 
and (b) while Houston and Osgood bori 
predict negative transfer for the ie » 
A-C and A/B-A, A-C paradigms, they 
disagree as to which pattern will produce 
the most negative transfer. i 

While this note has been — 
with pointing out the incompatibility o 
these two transfer surfaces, it should also 
be noted that both surfaces have obvious 
defects. For example. Osgood predicts 
no transfer in the A-B, B-A ager 4 
(e.g., positive transfer found by Murdock, 
1956) while Houston predicts that learn- 
ing (ie, A-B, A-B) will not take Blase 
Nevertheless, the relationships suggeste 
by the surfaces are important, and re- 
search investigating these da qure 2 
(and, hopefully, utilizing some of thé 
Paradigms suggested in this paper) 
should make an important contribution to 
our understanding of transfer. 
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